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2 (57) Abstract: The invention relates generaUy to genes that encode proteins that inhibit axonal grovt^. The invention relates specif- 
ically to genes encodign NgR protein homologs in humans and mice. The invention also includes compositions and methods for 
^ modulating the expression and activity of Nogo and the NgR proteins. Specifically, the invention includes peptides, proteins and 
^ antibodies that block Nogo-mediated inhibition of axonal extension. The compositions and methods of the invention are useful in 
^ the treatment of cranial o cerebral trauma, sprial cord injury, stroke or a demyelinating disease. 
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NOGO RECEPTOR HOMOLOGS 



FIELD OF THE INVENTION 
5 The invention relates to neurology and molecular biology. More particularly, 

the invention relates to CNS neurons and axonal growth 

BACKGROUND 

Among the mechanisms through which the cells of an organism commimicate 

10 with each oth^ and obtain information and stimuli from their environment is through 
cell membrane receptor molecules expressed on the cell surfece. Many such receptors 
have been identified, characterized, and sometimes classified into major receptor 
superfamilies based on structural motifs and signal transduction features. The receptors 
ar.e a first essential link for translatmg an extracellular signal into a cellular 

15 physiological response. 

Receptors on neurons are particularly important in the development of the 
nervous system during embryogenesis. The neurons form connections with target cells 
during development through axonal extension of the neurons toward the target cells in 
a receptor-mediated process. Axons and dendrites have a specialized region of their 

20 distal tips known as the growth cone. Growth cones enable the neuron to sense the 
local environment through a receptor-mediated process and direct the movement of the 
axon or dendrite of the n^on toward the neuron's target cell. This process is known 
as elongation. Growth cones can be sen^tive to several guidance cues, for ^cample, 
surface adhesiveness, growth &ctors, neurotransmitters and electric fields. The 

25 guidance of growth at the cone depends on various classes of adhesion molecules, 
intercellular signals, as well as factors that stimulate and inhibit growth cones. 

Interestingly, damaged neurons do not elongate in the central nervous system 
(CNS) following injury due to trauma or disease, whereas axons in the peripheral 
nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to 

30 elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS 
environment that is not permissive for axonal elongation. Classical grafting 
experiments by Aguayo and colleagues (e.g., Richardson etaLy (I9i0) Nature 284, 
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264-265) demonstrated that CNS axons can in &ct elongate over substantial distances 
within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. 
Therefore, ^venthe appropriate environment, CNS axons can regenerate, implying 
that CNS axonal injury can potentially be addressed by appropriate manipulation of the 

5 CNS environment. 

The absence of axon regeneration following injury can be attributed to the 
presence of axon growth inhibitors. These inhibitors are predominantly associated 
with myelin and constitute an inq)ortant barrier to regeneration. Axon growth 
inhibitors are present in CNS-derived myelin and the plasma membrane of 

10 oligodendrocytes that synthesize myelin in the CNS (Schwab et aL, (1993) Amu. Rev. 
NetfTosci. 16, S6S-S9S).* Myelin-assodated inhibitors appear to be a primary 
contributor to the failure of CNS axon regeneration in vivo after an mterruption of 
axonal continuity, whereas other non-myelin associated axon growth inhibitors in the 
CNS may play a lesser role. These inhibitors block axonal regeneration following 

1 5 neuronal injury due to trauma, stroke or viral infection. 

Numerous myelin-derived axon growth inhibitors have been characterized (see, 
for review, David etal, (1999) W0995394547; Bandman etal, (1999) U.S. Patent 
No. 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components 
of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein 

20 ^MAG), which have inhibitory activity for axonal extension, have been described as 
weU (Schwab et al,, (1990) WO9005191; Schwab et aL, (1997) U.S. Patent No. 
5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor 
that was originally characterized based on the effects of the purified protein in vitro 
and monoclonal antibodies that neutralize the protem*s activity (Schwab (1990) Esq^. 

25 Neurol 109, 2-5). The Nogo cDNA was first identified through random analysis of 
brain cDNA and had no suggested fimction (Nagase et al., (1998) DNA Res. 5, 
355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa 
myelin-associated axon growth inhibitor was discovered only recently (GrandPre et aL, 
(2000) Nature 403, 439-444; Chen et al, (2000) Nature 403, 434-439; Prinjha at aL, 

30 (2000) Nature 403, 383-384). 

Importantly, Nogo has been shown to be the primary component of CNS 
myelin responsible for inhibiting axonal elongation and regeneration. Nogo*s selective 
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e>q>ression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. 
axons) is consistent with the inhibitory efifects of CNS myelin, in contrast to P.S. 
myelin (GrandPre et ah, (2000) Nature 403, 434-439). In culture, Nogo inhibits 
axonal elongation and causes growth cone collapse (Spilhnann et al^ (1998) J, Biol 

5 Chem. 272, 19283-19293). Antibodies (e.g., IN-1) against Nogo have been shown to 
block most of the inhibitory action of CNS myelin on neurite growth in vitro 
(Spillmann et al, (1998) J. Biol Chem. 272:19283-19293). These experiments 
indicate that Nogo is the main component of CNS myelin responsible for mhibition of 
axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown 

10 to enhance axonal regeneration after spinal cord injury, resulting in recovery of 

behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 
343, 269-272; Bregman et al., (1995) Nature 378, 498-501). Thus, there is 
substantial evidence that Nogo is a disease-relevant molecular target. Agents that 
interfere with the binding of Nogo to its receptor would be e3q)ected to improve axonal 

15 regeneration in clinical states in which axons have been damaged, and improve patient 
outcome. 

Modulation of Nogo has been described as a means for treatment of 
regeneration for neurons damaged by trauma, infarction and degenerative disorders of 
the CNS (Schwab etal, (1994) W09417831; Tatagiba etaJ., (1997) Neurosurgery 
20 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et 
al, (1993) U.S. Patent No. 5,250,414); Schwab a/., (2000) U.S. Patent No. 
6,025,333). 

Antibodies which recognize Nogo have been suggested to be useful in the 
diagnosis and treatment of nerve damage resulting from trauma, in&rction and 

25 degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272; 
Schwab a/., (1997) U.S. Patent No. 5,684,133). For CNS axons, there is a 
correlation between the presence of myelin and the inhibition of axon regeneration 
over long distances (Savio and Schwab (1990) Proc. Natl Acad Sci. 87, 4130-4133; 
Keirstead etal,, (1992) Proa Natl Acad ScL 89, 11664-11668). After Nogo is 

30 blocked by antibodies, neurons can again extend across lesions caused by nerve 
damage (Schnell and Schwab (1990) Nature 343, 269-272). 
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SUMMARY OF THE INVENTION 
Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgRl) in 
mice and humans have been discovered. Various domains in the polypeptides encoded 
by the NgR2 and 'NgR3 genes have been identified and compared to domains in mouse 
5 and human N^l polypeptides. This comparison has led to identification of a 

consensus sequence (NgR consensus sequence) that characterizes a family of proteins 
(NgR family). Based on these and other discoveries, the invention features molecules 
and methods for modulating axonal growth in CNS neurons. 

The invention provides a polypeptide that contains a polypeptide containing a 
10 tryptophan rich LRRCT donuiin consisting of the amino add sequence: 

NXiWX2CX3CRARX4LWXsWXfiX7XgX,RXioSSSXnV 

Xi2 C Xi3 X|4 P Xi5 Xjg Xj7 Xig Xj9 X20 D L X21 X22 L X23 X24 X25 D 

15 

X^fiXj^XjgCLSEQlDNO: 19] 



wherein X is any protein amino acid or a gap, and the polypeptide does not 
include amino add sequence fi"om residue 260 to 309 of SEQ ID NO: 5 
20 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). 



Preferably, X17 and X23 are (independently) argjnine or lysine. In some 
embodiments, the amino acid sequence of the LRRCT domain is residues 261-3 10 of 
SEQ ID N0:2, or residues 261-3 10 of SEQ ID NO: 2 with up to 10 conservative 
25 amino acid substitutions. In some embodiments, the polypeptide contains the 
following NTLRRCT amino add sequence: 

C P X 1 X2 C X3 C Y X5 P X5 X7 T Xg S C X9 Xjo Xjj X|2 Xj^ X|5 Xig P 

■^18 -^19 ^20 -^21 ^22 ^23 R ^24 ^ ^ X25 N X27 I X28 X29 X30 X31 X32 X33 
X34 FX35 ^36 ^38 ^39 ^1 ^42 ^ W X43 X44 S N X^j X45 X^j X48 I X49 

30 X50 X51 X52 F X53 X54 X55 X56 X57 L E X58 L D L X59 D N X^o X^i L 

Xg5 P X^ T F X57 G L Xgg X^ L X^q Xji L X72 L X73 Xj^ C X75 L X75 X77 L Xyg 
X„ Xg, F Xgj G L X,3 L Q Y L Y L Q N Xg, X„ X„ Xg, L X50 D 
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X91 X92 F X93 D L X94 N L X95 H L F L H GN X95 X97 Xgg Xioo ^loz 
X,o3 Xjo, F R G L Xi„5 Xiofi L D R L L L H N Xjop Xjjo Xiji V H X112 
X113 A F X114 Xii5 L Xjig R L X117 Xjig L X119 L F X120 N X121 L X122 X123 L 

-^124 ^125 ^126 ^127 ^ Xi28 Xi29 L X130 L X132 X133 L R L N X134 N X135 W 

Xi35 C Xi37 C R Xi3g R X139 L W Xi4o W X141 Xi42 Xi43 Xi44 R Xi45 S S S Xi45 
V Xi47 C Xi4g Xi49 P Xi5o Xi5i Xjjj Xi53 X154 X155 D L X^ss X157 L Xj58 Xjjj X^fio 

D X161 Xi^ Xio C [SEQ ID N0:18] 



wherein X is any amino acid residue or a gap and wherein the polypeptide is not the 

10 polypeptide of SEQ ID NO: S (human NgRl) or SEQ ID NO: 17 (mouse NgRl). For 
example, X^ X37 and Xjg may represent a gap. Specific examples of polypeptides of 
tiie invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and 
SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) 
a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial 

IS CTS domain, if present, consists of no more than the first 39 amino acids of the CTS 
domain. While the polypeptide may contain a fimctional GPI domain, a fimctional GPI 
domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of 
the invention optionally includes an amino acid sequence of a heterologous 
polypeptide, e.g., an Fc portion of an antibody. 

20 The invention also provides a nucleic acid encoding an above-described 

polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably 
linked to an expression control sequence; and a transformed host cell containing the 
vector. A method of producing a polypeptide of tiie invention is also provided. The 
method includes introdudng a nucleic acid encoding the above-described polypeptide 

25 into a host cell, culturing the cell under conditions suitable for expression of the 
polypeptide, and recovering the polypeptide. 

The invention also provides an antisense molecule whose nucleotide sequence 
is conq>lementary to a nucleotide sequence encoding a polypeptide selected fi'om the 
group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID NO: 2, a 

30 polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide 

consisting of residues 32M38 of SEQ ID NO: 4, wherein the nucleic add is fi'om 8 to 
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100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The 
invention also provides a nucleic add encoding such an antisense molecule. 

The invention also provides an antibody that binds to an above-described 
polypeptide. Polypeptides or antibodies of the invention can be formulated into 

5 pharmaceutical compositions containing the polypeptide or antibody and a 
pharmaceutically acceptable carrier. 

The mvention also provides a method for decreasing mhibition of axonal 
growth of a CNS neuron. The method includes the step of contacting the neuron with 
an effective amount of a polypeptide or antibody of the invention. 

10 The invention also provides a method for treating a central nervous syst^ disease, 
disorder or injury. The method includes administering to a mammal, e.g., a human, an 
effective amount of a polypeptide or antibody of the invention. Exemplary diseases, 
disorders and injuries that may be treated ui^ molecules and methods of the 
invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, 

15 demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, 
encephalomyelitis, multifocal leukoencephalopathy, panencephaUtis, 
Marchiafava-Bignami disease, Spongy degeneration, Alexander's disease, Canavan's 
disease, metachromatic leukodystrophy and Krabbe*s disease. 

The invention also provides a method for identifying a molecule that binds a 

20 ' polypeptide of the invention. The method includes the steps of: (a) providing a 
polypeptide of the invention; (b) contacting the polypeptide with the candidate 
molecule; and (c) detecting binding of the candidate molecule to the polypeptide. 

Unless otherwise defined, all technical and scientific terms used herdn have the 
same meaning as commonly understood by one of ordinary skill in the art to which the 

25 invention belongs. In case of conflict, the present application, including definitions, 
will control. All publications, patent and other references mentioned herein are 
incorporated by reference. 

The materials, methods and examples presented below are illustrative only, and 
not intended to be limiting. Other features and advantages of the invention will be 

30 apparent fi'om the detail description and fi'om the claims. 

BRIEF DESCRIPTION OF THE FIGURES 
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Fig. 1 A-IB shows an alignment of NgR2 (SEQ ID N0:2) and NgR3 (SEQ ID 
N0:4) with the known NgR, NgRl (SEQ ID NO: 5) and the Consensus Sequence 
(SEQIDN0:6). 

5 

Fig, 2. niNgR3 does not bind hNogoA(1055-1120). COS-7 cells were 
transfected with vectors encoding myc-NgRl or myc-NgR3, fixed, and stained with 
anti-myc antibodies or AP-hNogoA(1055-l 120). 

10 Fig.3 . An alignment of the amino add sequences of human NgRl, murine 

NgRl, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino 
add #1 of murine NgR3. The consensus sequence is listed below. The LRR NT 
domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR S, and LLR 7 are 
indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded 

IS boxes; and the LLR CT domain is indicated by a shaded box. Amino adds in bold in 
LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine 
residue in LRR4. Box at C terminus indicates putative GPI signals. 

20 DETAILED DESCRDPTION OF THE INVEimON 

The present mvention provides purified and isolated polynucleotides (e.g., 
DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) encoding 
NgR homologs, referred to herein as NgR Unless indicated otherwise, as used herein, 

25 the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic add 
sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, 
peptide, oligopeptide, or amino add sequence. Specific proteins are designated by 
number, e.g., "NgR2" is a human NgR homolog, "NgiR3" is a murine-derived NgR 
homolog, and "NgRl " is the known NgR identified by Dr. Stephen Strittmatter. 

30 Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention 
include genomic DNA, cDNA and DNA that has been chemically synthesized in whole 
or in part. 
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Standard reference works setting forth the general principles of recombinant 
DNA technology known to those of skiD in the art include Ausubel et aL , CURRENT 
Protocols IN Molecular Biology, John Wiley & Sons, New York (1998); 
Sambrook et aL, MOLECULAR CLONING: A LABORATORY MANUAL, 2d Ed., Cold 
5 Spring Harbor Laboratory Press, Plainview, New York (1 989); Kaufinan ei aL , Eds,, 
Handbook of Molecular and Cellular Methods in Biology and Medicine, 
CRC Press, Boca Raton (1995); McPherson, Ed., DIRECTED MUTAGENESIS: A 
Practical App!R0ach,.IRL Press, Oxford (1991). 

As used herem, the term "axon" refers to a long cellular protrusion &om a 
10 neuron, whereby action potentials are conducted, either to or from the cell body. 

As used herein, the term "axonal growth" refers to an ^tension of the long 
process or axon, originating at the cell body and proceeded by the growth cone. 

As used herein, the term "central nervous system disorder" refers to any 
patholo^cal state assodated with abnormal flinction of the central nervous system 
15 (CNS). The term includes, but is not limited to, altered CNS ftmction resulting from 
physical trauma to cerebral tissue, viral infection, autoimmune machanisms and genetic 
mutation. 

As used herein, the term "demyelinating disease" refers to a pathological 
disorder characterized by the degradation of the myelin sheath of the oligodendrocyte 
20 cell membrane. 

As used herein, the term "growth cone" refers to a specialized region at the tip 
of a growing neurite that is responsible for sensing the local environment and moving 
the axon toward its appropriate synaptic target cell. 

As used herein, the term "growth cone movement" refers to the extension or 
25 collapse of the growth cone toward a neuron's target ceU. 

As used herein, the term "n«mte" rrfers to a process growing out of a neuron. 
As it is sometimes difScult to distmguish a dendrite from in axon in culture, the term 
"neurite" is used for both. 

As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the 
30 CNS whose fimction is to myelinate CNS axons. 

"Synthesized" as used herein and understood in the art, refers to 
polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
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"WhoUy" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthe^ed DNAs embrace those wherein only portions of the 
resulting DNA were produced by chemical means. By the term "re^on" is meant a 
physically contiguous portion of the primary structure of a biomolecule. In the case of 
5 proteins, a region is defined by a contiguous portion of the amino acid sequence of that 
protein. The term "domain" is herein defined as referring to a structural part of a 
biomolecule that contributes to a known or suspected fimction of the biomolecule. 
Domains may be co-extensive with regions or portions thereof; domains may also 
incorporate a portion of a biomolecule that is distinct firom a particular region, in 

10 addition to all or part of that re^on. Examples of Ng^ protein domains include, but 
are not limited to, the signal peptide, extracellular N-terminal) domain, and 
leudne-rich repeat domains. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indurect; affecting a response, i.e., 

IS having a measurable affect hi response to some exposure or stimulus, includmg, for 
example, the affinity of a compoimd for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of upstream 
or downstream proteins or other similar fimctions after some stimulus or event. Such 
activities may be measured by assays such as competitive inhibition of NgRl binding to 

20 Nogo assays wherein, for example, unlabeled, soluble NgR2 is added to an assay 

system in mcreasing concentrations to inhibit the binding of Nogo to NgJRl expressed 
on the surface of CHO cells. As another example, one may assess the ability of 
neurons to extend across lesions caused by n^e damage (as in Schnell and Schwab 
(1990) Nature 343, 269-272) following mMbition of Nogo by various forms of NgR2 

25 and/or NgR3 as a biolo^cal indicator of NgR fimctioa 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab', F(ab)2, and other fi-agments thereof Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
chimeric antibodes, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and 

30 humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compoimds or 
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combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds. Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 

5 due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. 

As used herein, the term "compoimd" means any identifiable chemical or 
molecule, including, but not limited to, small molecules, peptides, proteins, sugars, 
nucleotides or nucleic acids, and such compound can be natural or synthetic. 

10 As used herein, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proxunity to a polypeptide or polynucleotide of 
the invention. The polypeptide or polynucleotide can be in any number of buffers, 

IS salts, solutions etc. Contacting inchides, for example, placing the compound into a 
beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding theNgR or 
fragment thereof 

As used herein, the phrase "homologous nucleotide sequence," or "homologous 
20 amino acid sequence," or variations thereoi^ refers to sequences characterized by an 
identity at the nucleotide level, or a homology at the amino acid level, of at least the 
specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of protems. Such isofonns can be expressed in different tissues of 
the same organism as a result o^ for example, alternative spUdng of RKA. 
25 Altematively, isoforms can be encoded by different genes. Homologous nucleotide 
sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herdn. A homologous nucleotide sequence does 
30 not, however, include the nucleotide sequence encoding NgRl. Homologous amino 
acid sequences include those amino acid sequences which contain conservative amino 
add substitutions and which polypeptides have the same binding and/or activity. A 
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homologous amino acid sequence does not, however, include the amino acid sequence 
encoding other known NgRs. Percent homology can be determined by, for example, 
the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison WI), using the default 

5 settings, which uses the algorithm of Smith and Waterman (Adv. Appl Math., 1981, 2, 
482-489, which is incorporated herein by reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other 
proteins with which it is associated in nature, i.e., a nucleic add that has been removed 

10 from its native environment. Examples of isolated nucleic add molecules include, but 
are not limited to, recbmbmant DNA molecules contained in a vector, recombinant 
DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic add molecules, and synthetic DNA or KNA molecules. Preferably, an 
"isolated" nucleic add is free of sequences which naturally flank the nucleic add (/.e., 

1 S sequences located at the 5* and 3' ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic add is derived. For example, in various 
embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 
kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 

20 nucleic acid is derived. Moreover, an "isolated" nucleic add molecule, such as a 

cDNA molecule, can be substantially free of other cellular material or culture medium 
when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 

As used herem, the term "heterologous" refers to a nucleotide or amino add 

25 sequence that is a different, or non-corresponding sequence, or a sequence derived 

from a different spedes. For exaraple, a mouse NgR nucleotide or amino add sequence 
is heterologous to a human NgR nucleotide or amino add sequence, and a himian NgR 
nucleic or amino add sequence is heterologous to a human immunoglobulin nucleotide 
or amino add sequence. 

30 As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does 

not anchor itself in a membrane. Such soluble polypeptides include, for example, 
NgR2 and NgJlS polypeptides that lack a suflBdent portion of their GPI anchor signal 
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to anchor the polypeptide or are modified such that the GPI anchor signal is not 
adequate to result in replacement of the peptide with a GPI anchor. In preferred 
embodunents, up to 5, 10, 20 or 25 ammo acids are removed fi^om the C-terminus of 
NgR2 or NgJG to make the respective proteins soluble. As used herein soluble NgR 

5 polypeptides include full-length or truncated (e.g, , with internal deletions) NgR. 

Soluble NgR polypeptides may include the entire NgR protein up to the 
putative GPI signal sequence (e.g., ammo acid 1 to about amino acid 395 of NgR2, 
and firom amino acid 1 to about amino acid 438 of Ng3R3). In other embodiments, the 
signal peptide of the proteins may be removed or truncated (e.g., all or part of the 

10 signal sequence of NgR2, which spans amino add 1 to about ammo acid 30 of SEQ ID 
N0:2, may be removed; all or part of the rignal sequence of NgR3, which spans amino 
acid 1 to about amino acid 40 of SEQ ID N0:4, may be removed). In some 
embodiments, the mature NgR2 (SEQ ID N0:8) and the mature NgR3 (SEQ ID 
N0:9) are used. 

15 Soluble NgR polypeptides include at least one of the putative ligand-binding 

portions of NgR, including the first cysteine-rich re^on (SEQ ID NO: 10, the leucine 
repeat region (SEQ ID NO: 12) and the second cysteine-rich region (SEQ ID NO: 1 1). 
In some embodiments, soluble NgR polypeptides consist of amino acid 1 through 
about amino acid 395 of SEQ ID N0:2, or amino acid 1 through about amino acid 

20 438ofSEQIDNO:4. 

In other embodiments, the soluble NgR polypeptides are fusion proteins that 
contain amino adds 30 through about ammo add 395 of mature NgR2 or amino acid 
40 through about amino add 438 of NgR3, the C-terminal 10 ammo adds of a human 
IgG 1 hinge region containing the two cysteine residues thought to participate in 

25 interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy 
chain constant domain. This type of recombinant protein is designed to modulate 
inhibition of axonal dongafion through inhibition of the Nogo ligand bmding to NgRl, 
or by inhibiting the ligand of the NgP. firom interacting with cell surfece NgR The 
NgR portion of the fusion binds to the Nogo ligand and the IgGI portion binds to the 

30 FcyRI (macrophage) and Fcylll (NK cells and neutrophils) receptors. 

The production of the soluble polypeptides usefiil in this invention may be 
achieved by a variety of methods known in the art. For example, the polypeptides may 
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be derived from intact transmembrane NgR molecules by proteolysis using spedfic 
endopeptidases in combination ^th exopeptidases, Edman degradation, or both. The 
intact NgR molecule, in turn, may be purified from its natural source using 
conventional methods. Alternatively, the intact NgR may be produced by known 

5 recombmant DNA techniques using cDNAs, expression vectors and well-known 
techniques for recombinant gene expression. 

Preferably, the soluble polypeptides usefiil in the present invention are 
produced directly, thus eliminating the need for an entire NgR as a starting material. 
This may be achieved by conventional chemical synthesis techniques or by well-known 

10 recombinant DNA techniques wherein only those DNA sequences which encode the 
desired peptides are e}q>ressed in transformed hosts. For example, a gene which 
encodes the desired soluble NgfR polypeptide may be synthesized by chemical means 
using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the 
amino add sequence of the desired soluble NgjR polypeptide. Specific DNA sequences 

IS coding for the desired peptide also can be derived from the fiiU- length DNA sequence 
by isolation of specific restriction endonuclease fragments or by PCR syntheds of the 
specified region from cDNA 

A nucleic acid molecule of the present invention, e.g,^ a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NOs: 1, 3 or a complement of either of 

20 these nucleotide sequences, can be isolated using standard molecular biology 

techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequences of SEQ ID NOs: 1 or 3 as a hybridization probe, NgR nucleic 
add sequences can be isolated iising standard hybridization and cloning techniques 
(e.g., as described in Sambrook et a/., eds.. Molecular Cloning: aLaboratory 

25 Manual 2^ Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989; and Ausubel, etal., eds., CumyENT PROTOCOLS IN MOIJECUI^BIOLC^ John 
Wiley & Sons, New York, NY, 1993), 

A nucldc acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 

30 according to standard PCR amplification techniques. The nucleic add so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 



wo 02/29059 



PCTAJSOl/31488 



-14- 

Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 
5 As used herein, the term "oligonucleotide" refers to a series of linked 

nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed firom) a genomic 
or cDNA sequence and is used to amplify, confirm or reveal the presence of an 
identical, similar or complementary DNA or feNA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a DNA sequence having at least about 10 
nucleotides and as many as about SO nucleotides, preferably about IS to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 

15 depending on use. They are used in the detection of identical, shnilar or 

complementary nucleic add sequences. Longer length probes are usually obtained firom 
a natural or recombinant source, are highly specific and much slower to hybridize than 
oligomers. They may be single- or double-stranded and carefiilly designed to have 
specificity in PCR, hybridization membrane-based, or ELIS A-like technologies. 

20 The term "preventing" refers to decreasing the probability that an organism 

contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition m the organism. 

The term "therapeutic effect" refers to the inhibition or activation &ctors 

25 causing or contributing to the abnormal condition. A therapeutic effect relieves to 
some ^ent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
cells; (b) inhibition (/.e., slowing or stopping) of cell death; (c) inhibition of 

30 degeneration; (d) relieving to some extent one or more of the symptoms associated 
with the abnormal condition; and (e) enhancing the fimction of the affected population 
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of cells. Compoimds demonstrating efficacy against abnormal ra^ 
identified as described herein. 

The term "abnormal condition" refers to a function in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
5 condition can relate to cell proliferation, cell diflferentiation, cell signaling, or cell 

survival. An abnormal condition may also include obesity, diabetic complications such 
as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty 
acid uptake and metabolism. 

Abnormal cell proliferative conditions, for example, include cancers such as 
10 fibrotic and mesang^al disorders, abnormal angiogenesis and vasculogenesis, wound 
healing, psoriasis, diabetes mellitus and infiammatioa 

Abnormal differentiation conditions include, for example, neurodegenerative 
disorders, slow wound healing rates and slow tissue grafting healing rates. 

Abnormal cell signaling conditions include, for example, psychiatric disorders 
1 5 involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
protem kinases are associated with the apoptosis pathways. Aberrations in the 
fimction of any one of the protein kinases could lead to cell immortality or premature 
20 cell death. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated 
when the cells or tissues of the organism exist withm the organism or outside of the 
organism. Cells existing outside the organism can be maintained or grown in ceil 

25 culture dishes. For cells harbored within the organism, many techniques exist in the art 
to administer compounds, including (but not limited to) oral, parenteral, dermal, 
injection, and aerosol applications. For cells out^de of the organism, multiple 
techniques exist in the art to administer the compounds, including (but not limited to) 
cell microiigection techniques, transformation techniques and carrier techniques. 

30 The abnormal condition can also be prevented or treated by administering a 

compoxmd to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism fimction can then 
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be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification" it is meant increased numbers of DNA or RNA in a cell 
compared with normal cells. "Amplification" as it refers to RNA can be the detectable 
5 presence of RNA in cells, since in some normal cells there is no basal expression of 
RNA In other normal cells, a basal level of expression exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, compared to 
the basal level. 

The amino acid sequences are presented in the amino to carboxy direction, 
10 firom left to right. The amino and carbo>^ groups are not presented in the sequence. 
The nucleotide sequences are presented by single strand only, in the 5' to 3* direction, 
firom left to right. Nucleotides and amino acids are represented in the manner 
recommended by the lUPAC-IUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 

15 

Nucleic Acids 

Genomic DNA of the invention comprises the protein-coding region for a 
polypeptide of the invention and is also intended to include allelic variants thereof It is 
widely understood that, for many genes, genomic DNA is transcribed into RNA 

20 transcripts that undergo one or more splicing events wherein intron (/.e. , non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 
spliced by alternative mechanisms, and therefore be subject to removal of dififerent 
RNA sequences but still encode a NgR polypeptide, are referred to in the art as splice 
variants which are embraced by the invention. Splice variants comprehended by the 

25 invention therefore are encoded by the same original genomic DNA sequences but 
arise fi-om distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting fi-om recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as 

30 opposed to non-naturally occurring variants arising firom in vitro manipulation). 

The invention also comprehends cDNA that is obtained through reverse 
transcription of an RNA polynucleotide encoding NgR (conventionally followed by 
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second-strand synthesis of a complementaiy strand to provide a double-stranded 
DNA). 

Preferred DNA sequences encoding a human NgR polypeptide is set out in 
SEQ ID NOs: 1 and 13. A preferred DNA of the invention comprises a double 

5 stranded molecule comprising the coding molecule (/.e., the "coding strand") along 
with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick 
base-pairing rules for DNA. Also preferred are other polynucleotides encoding NgR 
polypeptides, as shown in SEQ ID N0:3, which comprises murine NgR homolog, 

10 NgR3. 

Also preferred are nucleotide sequences that encode at least a portion of a NgR 
polypeptide that has at least one biological function of a NgR More preferred are 
nucleotide sequences that encode a portion of NgR that encodes at least the mature 
NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide 
IS sequences that encode the portion of NgR that encodes at least the ligand-binding 
region of NgR 

The invention further embraces other species, preferably mammalian, homologs 
of the human NgiR DNA. Species homologs, sometimes referred to as "orthologs," in 
general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at 

20 least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% homology with human DNA of the invention. 
Graerally, percent sequence "homology" with respect to polynucleotides of the 
invention may be calculated as the percentage of nucleotide bases in the candidate 
sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID 

25 NOs:l, 3 or 13, after aligning the sequences and introdudng gaps, if necessary, to 
achieve the maximum percent sequence identity. 

The polynucleotide sequence mformation provided by the invention makes 
possible large-scale expression of the encoded polypeptide by techniques well known 
and routinely practiced in the art. Polynucleotides of the invention also permit 

30 identification and isolation of polynucleotides encoding related NgR polypeptides, such 
as human allelic variants and species homologs, by well-known techniques including 
Southern and/or Northern hybridization, and polymerase chain reaction (PCR). 
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Examples of rdated polynucleotides include human and non-human genomic 
sequences, including allelic variants, as well as polynucleotides encoding polypeptides 
homologous to NgR and structurally related polypeptides sharing one or more 
biological, immunological, and/or physical properties of NgR. Non-human species 

5 genes encoding proteins homologous to NgR can also be identified by Southern and/or 
PGR analysis and are usefiil in animal models for NgR disorders. Knowledge of the 
sequence of a himian NgR DNA also makes possible through use of Southern 
hybridization or polymerase chain reaction (PGR) the identification of genomic DNA 
sequences encoding NgR expression control regulatory sequences such as promoters, 

10 operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also usefiil in hybridization assays to detect the capacity of cells to express NgR. 
Polynucleotides of the invention may also provide a basis for diagnostic methods usefid 
for identifying a genetic alteration(s) in a NgR locus that underlies a disease state or 
states, which information is usefiil both for diagnosis and for selection of therapeutic 

15 strategies. 

The disclosure herein of a fidl-length polynucleotide encoding a NgR 
polypeptide makes readily available to the worker of ordinary skill in the art every 
possible fragment of the fiiU-length polynucleotide. The mvention, therefore, provides 
fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at 

20 least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide 

encoding NgR. Preferably, fragments of polynucleotides of the invention comprise 
sequences unique to the NgR-encoding polynucleotide sequence, and therefore 
hybridize under highly stringent or moderately stringent conditions only (/.e., 
"specifically") to polynucleotides encoding NgR (or fragments thereof. 

25 Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include firagments of the fiiU-length 
sequence derived from introns, regulatory regjions, and/or other non-translated 
sequences. Sequences unique to polynucleotides of the invention are recognizable 
through sequence comparison to other known polynucleotides, and can be identified 

30 through use of alignment programs routinely utilized in the art, e.g. , those made 
available in public sequence databases. Such sequences also are recognizable from 
Southern hybridization analyses to determine the number of fragments of genomic 
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DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, inchiding radioactive, fluorescent and 
enzymatic labeling. 

Fragments of polynucleotides are particularly usefiil as probes for detection of 

5 full-length or fragment of NgR polynucleotides. One or more polynucleotides can be 
included in kits that are used to detect the presence of a polynucleotide encoding NgR, 
or used to detect variations in a polynucleotide sequence encoding NgR. 

The invention also embraces DNAs encoding NgR polypeptides that hybridize 
under moderately stringent or high stringency conditions to the noncoding strand, or 

10 complement, of the polynucleotide in any of SEQ ID NOs: 1 or 3. 

Stringent conditions are known to those skilled in the art and can be found in 
CurreotProtcx:0LSINM0I£CIJ1j^ John Wiley & Sons, N.Y. (1989), 

6.3.176.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain 

15 hybridized to each other A non-limitmg example of stringent hybridjbzation conditions 
is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 
mM EDTA, 0.02% PVP, 0.02% FicoU, 0.02% BSA and 500 mg/ml denatured sahnon 
sperm DNA at 65'C. This hybridization is followed by one or more washes in 0.2X 
SSC, 0.01% BSA at SO'C. An isolated nucleic acid molecule of the invention that 

20 hybridizes under stringent conditions to the sequence of SEQ ID NOs: 1 or 3 
corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic add molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e,g,, encodes a natural protein). 
As used herein, "stringent hybridization conditions" means: 42*C in a hybridization 

25 solution comprising 50% fonnamide, 1% SDS, 1 MNaCl, 10% (wt/vol) dextran 

sulfate, and washing twice for 30 minutes at 60''C in a wash solution comprising 0. 1 X 
SSC and 1% SDS. 

Vectors 

30 Another aspect of the present invention is directed to vectors, or recombinant 

expression vectors, comprising any of the nucleic acid molecules described above. 
Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to 
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express DNA which encodes NgIL As used herein, the term "vector" refers to a 
nucleic add molecule capable of transporting another nucleic add to which it has been 
linked. One type of vector is a "plasmid", wWch refers to a circular double stranded 
DNA loop into which additional DNA segments can be ligated. Another type of 

5 vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are mtegrated into the genome of a host cell upon introduction 

10 mto the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the e?cpression of genes to which they are 
operatively linked. Such vectors are referred to herein as "e?q)ression vectors". In 
general, e>q)ression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" can be used 

15 interchangeably as the plasmid is the most commonly used form of vector. However, 
the invention is intended to include such other forms of expression vectors, such as 
viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), that serve equivalent functions. 

Expression of proteins in prokaryotes is most often carried out inE, coli with 

20 vectors containing constitutive or indudble promoters directing the expression of 

either fiision or non-fiision proteins. Fusion vectors add a number of amino adds to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such ftision vectors typically serve three purposes: (1) to increase expression of 
recombmant protein; (2) to increase the solubility of the recombinant protein; and (3) 

25 to aid in the purification of the recombinant protdn by acting as a ligand in affinity 
purification. Often, in fiision expression vectors, a proteolytic cleavage site is 
introduced at the jxmction of the fiision moiety and the recombinant protein to enable 
separation of the recombinant protein from the fiision moiety subsequent to 
purification of the fiision protein. Such enzymes, and their cognate recognition 

30 sequences, include Factor Xa, thrombm and enterokinase. Typical fiision expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 
31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRTTS (Pharmacia, 
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Piscataway, N J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, 
or protdn A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion£. coli expression vectors include 
pTrc (Amrann et al, (1988) Gene 69, 301-315) and pET lid (Studier et al, GENE 

5 Expression Technology: Methods in Enzymology 1 85, Academic Press, San 
Diego, CA. (1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to 
e?q>ress the protein in host bacteria with an impaired capacity to proteolytically cleave 
the recombinant protein. See, Gottesman, GENE EXPRESSION Technology: 

10 Methods IN Enzymology 185, Academic Press, San Diego, CA. (1990) 1 19-128. 
Another strategy is to alter the nucleic acid sequence of the nucleic add to be inserted 
into an expres^on vector so that the individual codons for each amino acid are those 
preferentially utilized in K coli (Wada et al, (1992) Nucleic Acids Res. 20, 21 1 1- 
21 18). Such alteration of nucleic add sequences of the invention can be carried out by 

1 S standard DNA synthesis techniques. 

In another embodiment, the NgR expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSecl 
(Baldari, etaL, (im)EMBOJ. 6, 229-234), pMFa (Kurjan and Herskowitz (1982) 
Cell 30, 933-943), pJRY88 (Schultz et al, (1987) Gene 54, 113-123), pYES2 

20 (Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, 
CA). 

Alternatively, NgR can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect 
cells (e.g., SF9 cells) include the pAc series (Smith etal, (1983) Afo/. Cell. Biol 3, 

25 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170, 3 1-39). 
In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian e^qpr^on vector. Examples of mamm alian 
expression vectors inchide pCDM8 (Seed (1987) NcOure 329, 840) and pMKPC 
(Kaufinan et al (1987) EMBO J. 6, 187-195). When used in mammalian cells, the 

30 expression vector's control fimctions are often provided by viral regulatory elements. 
For example, commonly used promoter^ are derived from polyoma, adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
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prokaryotic and eukaryotic cells. See, e.g.. Chapters 16 and 17 of Sambrook et aL^ 
(Eds.) Molecular Cloning: ALaboratory Manual. 2"** Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
In another embodiment, the recombinant mammalian expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to e5cpress the nucleic add). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton 

10 (1988) Ack. Immunol 43, 235-275), in particular promoters of T cell receptors 

(Winoto and Baltimore (\9%9)EMBOJ. 8, 729-733) and immunoglobulins (Banerji et 
al (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron- 
specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc, 
Natl Acad Sci. USA 86, 5473-5477), pancreas-spedfic promoters ^dlund et al 

15 (1985) Science 230, 912-916), and mammary gland-specific promoters {e.g., milk 
whey promoter; U.S. Pat. No. 4,873,3 16 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, e.g., the 
murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the 
a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546). 

20 The invention fiirther provides a recombinant expresaon vector comprising a 

DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operativdy linked to a regulatory sequence 
in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense KgR mRNA Regulatory sequences operatively linked 

25 to a nucldc acid cloned in the antisense orientation can be chosen that direct the 
continuous expression of the antisense RNA molecule in a variety of cell types» for 
instance viral promoters and/or enhancers^ or regulatory sequences can be chosen that 
direct constitutive, tissue-specific or cell-type-spedfic expression of antisense RNA 
The antisense e>q)ression vector can be in the form of a recombmant plasmid, 

30 phagemid or attenuated virus in which antisense nucleic adds are produced under the 
control of a high effidency regulatory region, the activity of which can be determined 
by the cell type into v^ch the vector is introduced. For a discussion of the regulation 
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of gene expression using antisense genes see Weintraub ei al^Antisense RNA as a 
molecular tool for genetic anafysis, REVlEWS--TRENDSINGENEncs, Vol. 1(1) 1986. 

Preferred vectors mclude, but are not limited to, plasmids, phages, cosmids, 
episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments 
5 integratable into the host genome by homologous recombination). Preferred viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, 
herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia 
viruses and retroviruses. Preferred expression vectors include, but are not limited to, 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 
10 include, but are not limited to, pSPORT™ vectors, pGEM™ vectors (Promega), 
pPROEXvectors™ (LTI, Bethesda, MD), Bluescript™ vectors (Stratagene), pQE™ 
vectors (Qiagen), pSE420™ (Invitrogen) and pYES2™(Invitrogen). 

Preferred expression vectors are repUcable DNA constructs in which a DNA 
sequence encoding NgR is operably linked or connected to suitable control sequences 
15 capable of effecting the expression of the NgR in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it controls 
the transcription of the sequence. Amplification vectors do not require expression 
control domains, but rather need only the ability to replicate in a host, usually 
20 conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. The need for control sequences in the expression vector will vary 
depending upon the host selected and the transformation method chosen. Generally, 
control sequences include, but are not limited to a transcriptional promoter, enhancers, 
an optional operator sequence to control transcription, polyadenylation signals, a 
25 sequence encoding suitable miRNA ribosomal binding and sequences which control the 
termination of transcription and translation. Such regulatory sequences are described, 
for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN 
Enzymology 185, Academic Press, SanDi^o, CA (1990). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence in many types 
30 of host cell and those that direct expression of the nucleotide sequence only in certain 
host cells (e.g., tissue-specific regulatory sequences). It will be appreciated by those 
skilled in the art that the design of the expression vector can depend on such fectors as 
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the choice of the host cell to be transfonned, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce protdns or peptides, inchiding fusion proteins or peptides, encoded by 
nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion 

5 proteins, etc.). 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL 
promoters of bacteriophage lambda (THE BACTERIOPEIAGE LAMBDA, Hershey, A.D. 

10 (Ed.), Cold Spring Haibor Laboratory Press, Cold Spring Harbor, NY (1973), which is 
incorporated herein by reference in its entirety; Lambda H, Hendrix, R.W. (EA), Cold 
Spring Hari)or Laboratory Press, Cold Spring Harbor, NY (1980), v\^hich is 
incorporated herem by reference in its entirety); the trp, recA, heat shock, and lacZ 
promoters of £ coli and the SV40 early promoter (Benoist et al., (1981) Nature 290, 

1 S 304-3 1 0, which is incorporated herein by reference in its entirety). Additional 

promoters include, but are not linuted to, mouse manMnary tumor virus, long terminal 
repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate 
early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, 
human hemoglobin, human muscle creatine and human metallothionein. 

20 Additional regulatory sequences can also be included in preferred vectors. 

Preferred examples of suitable regulatory sequences are represented by the 
Shine-Dalgamo sequence of the replicase gene of the phage MS-2 and of the gene cll 
of bacteriophage lambda. The Shine-Dalgamo sequence may be directly followed by 
DNA encoding NgiR and result in the expres^on of the mature NgR protdn. 

25 Moreover, suitable expression vectors can inchide an appropriate marker that 

allows the screening of the transformed host cells. The transformation of the selected 
host is carried out using any one of the various techniques well known to the expert in 
the art and described in Sambrook ei al., siqjra. 

An origin of replication can also be provided either by construction of the 

30 vector to include an exogenous origin or may be provided by the host cell 

chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors 
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which contain viral origins of replication, one skilled in the art can transform 
mammalian cells by the method of co-transformation with a selectable marker and NgR 
DNA. An example of a suitable marker is dihydrofolate reductase (DHFR) or 
thymidine kinase {see, U.S. Patent No. 4,399,216). 

5 Nucleotide sequences encoding NgR may be recombined with vector DNA in 

accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction enzyme digestion to provide appropriate termini, filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining and ligation with appropriate ligases. Techniques for such manipulation are 

10 disclosed by Sambrook et al , supra and are well known in the art. Methods for 

construction of mammalian expression vectors are disclosed in, for example, Okayama 
etcd., (1983)Afo/. Cell Biol 3:280, Cosmane/a/. (1986) Afo/. Immunol 23:935, 
Cosman et al, (1984) Nahire 3 12:768, EP-A-0367566, and WO 91/18982, each of 
which is incorporated herem by reference in its entirety. 

15 

Host Cells and Transformed Host Cells 

According to another aspect of the mvention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or 
vector of the invention) in a manner that permits expression of the encoded NgR 

20 polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. 
Polynucleotides of the invention may be introduced into the host cell as part of a 
circular plasmid, or as linear DNA comprising an isolated protein coding region or a 
viral vector. Methods for introducing DNA into the host cell that are well known and 
routinely practiced in the art include transformation, transfection, electroporation, 

25 nuclear iqection, or fii^on with carriers such as liposomes, micelles, ghost cells and 
protoplasts. Expression systems of the invention mclude bacterial, yeast, fimgal, plant, 
insect, invertebrate, vertebrate and mammalian cells systems. 

Host cells of the invention are a valuable source of immunogen for 
development of antibodies specifically immimoreactive with NgR. Host cells of the 

30 invention are also usefiil in methods for the large-scale production of NgR 

polypeptides wherein the cells are grown in a suitable culture medium and the desired 
polypeptide products are isolated fi-om the cells, or firom the medium in which the cells 
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are grown, by purification methods known in the art, e.g.^ conventional 
chromatographic methods including immunoaflSnity chromatography, receptor afiSnity 
chronmtography, hydrophobic interaction chromatography, lectin aflBnity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 

5 high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is . 
e^q>ressed and purified as a fiision protein having a specific tag, label or chelating 
moiety that is recognized by a spedfic binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fiision protein. 

10 Cleavage of the fiision component may produce a form of the desired protein having 
additional amino acid residues as a result of the cleavage process. 

Knowledge of NgR DNA sequences allows for modification of cells to permit, 
or increase, expression of endogenous NgR. Cells can be modified (e.g., by 
homologous recombmarion) to provide increased e>q[)ression by repladbog, in whole or 

IS in part, the naturally occurring NgR promoter with all or part of a heterologous 

promoter so that the cells express NgR at higher levels. The heterologous promoter is 
mserted in such a manner that it is operatively linked to endogenous NgR encoding 
sequences. (See, for example, PCT International PuWicationNo. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 

20 WO 91/09955.) It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA {e.g., ada, dhfi", and the multifiinctional CAD gme 
which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous 
promoter DNA. S'linked to the NgR coding sequence, amplification of the marker 

25 DNA by standard selection methods results in co-amplification of the NgR coding 
sequences in the cells. 

The DNA sequence information provided by the present invention also makes 
possible the development (e,g., by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express 

30 fimctional NgR or that express a variant of NgR Such animals (especially small 

laboratory animals such as rats, rabbits and mice) are usefiil as models for studying the 
in vivo activities of NgR and modulators of NgR 
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Suitable host cells for expression of the polypeptides of the invention include, 
but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaiyotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell 
capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are 

5 not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, 
Streptomyces and Staphylococcus, 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of e^qpressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but 

10 are not limited to, non-human mammalian tissue culture cells and human tissue culture 
cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, 
Chinese hamster ovary cells (CHO cells), Afiican green monkey kidney cells (COS 
cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell 
culture has become a routine procedure (jee. Tissue Culture, Academic Press, Kruse 

15 and Patterson, Eds. (1973), which is incorporated herdn by reference in its entirety). 

In addition, a yeast cell may be employed as a host cell. Preferred yeast cells 
include, but are not limited to, the genera Saccharomyces, Pichia and Klweromyces. 
Preferred yeast hosts are 5. cerevisiae and P. pastoris. Preferred yeast vectors can 
contain an origin of replication sequence fi-om a 2T yeast plasmid, an autonomously 

20 replication sequence (ARS), a promoter re^on, sequences for polyadenylation, 

sequences for transcription termination and a selectable marker gene. Shuttle vectors 
for replication in both yeast and K coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the mvention are expressed using a baculovirus 

25 expression system (see, Luckow et al., Bio/Technology, 1988, 6, 47; BACULOVIRUS 
EXPRESSION Vectors: A Laboratory Manual, CRielly et al (Eds.), W.H. 
Freeman and Company, New York, 1992; and U.S. Patent No. 4,879,236, each of 
which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ 
complete baculovirus expression system (Invitrogen) can, for example, be used for 

30 production in insect cells. 

Suitable host cells are discussed fiirther in Goeddel, Gem Expression 
technology: Methods in Enzymology 185, Academic Press, San Diego, CA 
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(1990). Alternatively, the recombinant e3q)ression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Vector DNA can be introduced into prokaryotic or eukaiyotic cells via 
5 conventional transfonnation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic add (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
1 0 transfecting host ceUs can be foimd in Sambrook, et aL (MOLECULAR Cloning: A 
Laboratory Manual. 2nd ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of manomalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of ceUs 
IS may integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker {e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various 
selectable markers include those that confer resistance to drugs, such as G418, 
hygromydn, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid 
20 encodmg a selectable marker can be mtroduced into a host cell on the same vector as 
that encoding Ngjl or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic add can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells die). 
In a preferred embodiment, the polypeptides of the invention, including forms 
25 of NgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, NglR/Ig 
fusions and fragments and variations of each of the above are expressed in Chinese 
Hamster Ovary (CHO) cells. 

In order to introduce the DNA fragment coding for the NgR. protein or 
polypeptide mto the CHO ceB to express the recombmant NgR protein or polypeptide, 
30 it is necessary to construct the expression vector. 

The vectors for CHO expression include, but are not limited to, pAl-11, pXTl, 
pRc/CMV, pRc/RSV and pcDNAINeo. The promoter is not specifically limited 
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provided it effectively promotes expression in CHO cells. Examples of suitable 
promoters are: SRa, SV40, LTEl, CMV, and HSV-TK. Of these, CMV and Sra 
promoters are preferred. 

In addition to the above-mentioned promoters, the expression vectors may 
5 contain enhancers, splicing signals, polyadenylation signals, selectable markers and an 
S V40 replication origin. Suitable selectable markers include, but are not limited to the 
dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate 
(MTX), the ampicillin resistance gene, and the neomycin resistance gene. 

Examples of the expression vectors each containing the DNA coding for NgR, 
10 portions, fragments and soluble constructs thereoi^ inchide the vector (such as one 
described above), into which the promoter is operably linked preferably upstream) to 
the nucleotide sequence ^coding the desired NgR construct; a polyadenylation signal 
downstream from the nucleotide sequence encoding the NgR construct; and, 
preferably, the vector includes an operable DHER gene. Preferably, the ampicillin 
IS resistant gene is also operably contained in the vector. 

CHO cell lackmg the DHFR gene (Uriaub, G. et al, (1980) Proc. Natl Acad. 
Set USA 77, 4216^220) and CHO-Kl {Proc. Natl Acad ScL USA 60, 1275 (1968)) 
are suitable for use. 

The NgR expression vectors prepared as above are introduced into CHO cells 
20 by any known method, including, but not limited to the calcium phosphate method 

(Graham and van der Eb (1973) Virol 52, 456-467) and electroporation (Nuemann et 
al, {m2)EMB0J. 1, 841-845). 

Transformants carrying the expression vectors are selected based on the 
above-mentioned selectable markers. Repeated clonal selection of the transformants 
25 using the selectable markers allows selection of stable cell lines having high expression 
of the NgR constructs. Increased MTX concentrations in the selection medium allows 
gene an^lification and greats expression of the desired protein. The CHO cell 
containing the recombinant NgR can be produced by cultivating the CHO cells 
containing the NR expression vectors constitutivdy expressing the NgR constructs. 
30 Media used in cultivating CHO cells inchides DMEM medium supplemented 

with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The 
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pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 
40'C for about 15 to 72 hours with aeration. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce {i.e., express) NgR protein. Accordingly, the 

5 invention further provides methods for producing NgR protein using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding NgR has been 
introduced) m a suitable medium such that NgR protein is produced. In another 
embodiment, the method fiirther comprises isolating NgR from the medium or the host 

10 ceD. 

In situations where the NgiR polypeptide will be found primarily mtraceUularly, 
intracellular material (including inclusion bodies for Gram-negative bacteria) can be 
extracted from the host cell using any standard technique known to one of ordinary 
skill in the art. Such methods would encompass, by way of example and not by way of 

15 limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by 
French press, homogenization and/or sonication followed by centrifugation. 

If the NgR polypeptide has formed inclusion bodies in the cytosol, such 
inclusion bodies may frequently bind to the inner and/or outer cellular membranes. 
Upon centrifugation, the inclusion bodies will be found primarily in the pellet material. 

20 The pellet material can then be treated at pH extremes or with one or more chaotropic 
agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives 
in the presence of a redudng agent such as dithiothreitol at alkaline pH or 
tris-caiboxyethyl phosphme at add pH to release, break apart and sohibilize the 
inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel 

25 electrophoreds, immimopredpitation or the like. Various methods of isolating the 
NgR polypeptide would be apparent to one of ordinary skill in the art, for ^cample, 
isolation may be accomplished using standard methods such as those set forth below 
and in Marston et al (1990) Meth. EnzymoL 182, 264-275 (incorporated by reference 
herein in its entirety). 

30 If isolated NgR polypeptide is not biologically active following the isolation 

procedure employed, various methods for "refolding" or converting the polypeptide to 
its tertiary structure and generating disulfide linkages, can be used to restore biological 
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acdvity. Methods known to one of ordinary skill in the art include adjusting the pH of 
the solubilized polypeptide to a pH usually above 7 and in the presence of a particular 
concentration of a chaotrope. The selection of chaotrope is very similar to the choices 
used for inclusion body solubilization but usually at a lower concentration and is not 

5 necessarily the same chaotrope as used for the solubilization. It may be required to 
employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, 
to generate a particular redox potential allowing for disulfide shuffling to occur in the 
formation of the protein's cysteine bridge(s). Some of the commonly used redox 
couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, 

10 dithiothreitol pTT)/ditiuane DTT, 2-mercaptoetiianol (bME)/dithio-bOV[E). To 

increase the efficiency of the refolding, it may be necessary to employ a cosolvmt, such 
as glycerol, polyethylene glycol of various molecular weights and arguiine. 

Transgenic Animals 

IS The host cells of the invention can also be used to produce non-human 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have 
been introduced. Such host cells can then be used to create non-human transgenic 
animals in which exogenous NgR sequences have been introduced into their genome or 

20 homologous recombinant animals in which endogenous NgR sequences have been 
altered. Such animals are usefiil for studying the ftmction and/or activity of NgR and 
for identifying and/or evaluating modulators of NgR activity. As used herein, a 
"transgenic animal" is a non-himnian animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 

25 a transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell firom which a transgenic animal develops and that 
remains m the genome of the mature animal, therd)y directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 

30 used herdn, a "homologous recombinant animal" is a non-human ammal, preferably a 
mammal, more preferably a mouse, m which an endogenous NgR gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a ceU of the animal, e.g., an embryonic cell of the anunal, 
prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NgR- 
encoding nucleic add into the male pronuclei of a fertilized oocyte, e.g., by 
5 microinjection, retroviral infection, and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID 
NOs: 1 or 3 can be introduced as a transgene into the genome of a non-human animal. 
Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR 
gene, can be isolated based on hybridization to the human NgR cDNA (described 

10 fiirther above) and used as a transgene. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(5) can be operably linked to the NgR 
transgene to direct expression of NgR protdn to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particulaily 

1 5 animals such as mice, have become conventional in the art and are described, for 

example, in U.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan 1986, in 
Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 

20 NgR transgene in its genome and/or expression of NgR mKNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
NgR can fiuther be bred to other transgenic animals carrying other transgenes. 
' To create a homologous recombinant animal, a vector is prepared which 

25 contains at least a portion of a NgR gene into which a deletion, addition or substitution 
has been introduced to thereby alter, e,g. , fimctionally disrupt, the NgR gene. The 
NgR gene can be a human gene SEQ ID NOs: 1 or 13), but more preferably, is a 
non-human homolog of a human NgR gene. For exan^)le, a mouse homolog of human 
NgR gene of SEQ ID NOs: 1 or 13 can be used to construct a homologous 

30 recombination vector suitable for altering an endogenous NgR gene m the mouse 
genome. In one embodiment, the vector is dedgned such that, upon homologous 
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recombination, the endogenous NgR gene is functionally disrupted (/. e, , no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Ahematively, the vector can be designed such that, upon homologous 
recombmation, the endogenous NgR gene is mutated or otherwise altered but still 
5 encodes ftmctional protein (e.g. , the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous NgR protein). In the homologous 
recombination vector, the altered portion of the NgR gene is flanked at its 5' and 3' 
ends by additional nucleic acid of the NgR gene to allow for homologous 
recombination to occur between the oogenous NgR gene carried by the vector and an 

1 0 endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic 
acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5* and 3' 
ends) are included in the vector. See e.g., Thomas et al. (1987) Cell 51:503 for a 
description of homologous recombination vectors. The vector is introduced into an 

15 embryonic stem cell line (e.g., by electroporation) and cells in which the introduced 
NgR gene has homologously recombined with the endogenous NgR gene are selected 
(see e.g., Li et al (1992) Cell 69:915). ' 

The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: 

20 Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, 

Robertson, ed. IRL, Oxford, pp. 1 13-152. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term. 
Progeny harboring the homologously recombined DNA in their germ cells can be used 
to breed animals in which all cells of the animal contain the homologously recombined 

25 DNA by germline transmission of the transgene. Mettiods for constructing 
homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Curr. Opiru BiotecJmol 2:823-829; POT 
Intemational Pubhcation Nos. : WO 90/1 1354; WO 91/01 140; WO 92/0968; and WO 
93/04169. 

30 In another ernbodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
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a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl Acad Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gonnan etal (1991) 
Science 251:1351-1355. If a cre/lo?dP recombinase system is used to regulate 

5 expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
oth^ containing a transgene encoding a recombinase. 

10 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
813. In briet a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter Gq phase. The quiescent cell can then be 
fused, e-g., through the use of electrical pulses, to an enucleated oocyte from an animal 

15 of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured such that it develops to morula or bjastocyte and then 
transferred to pseudopregnant female foster animal. The offspring borne of this female 
foster animal will be a clone of the animal from which the cell, e.g:, the somatic cell, is 
isolated. 

20 

Antisense 

Also provided by the invention are antisense polynucleotides that recognize and 
hybridize to Ng3R polynucleotides. Full-length and fragment antisense polynucleotides 
are provided. Fragmmt antisense molecules of the invention include (i) those that 

25 specifically recognize and hybridize to NgR RNA (as determuied by sequence 
comparison of DNA encoding NgR to DNA encoding other known molecules). 
Identification of sequences unique to NgR encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of 
commerdally available sequence comparison programs. After identification of the 

30 desired sequmces, isolation through restriction digestion or amplification using any of 
the various polymerase chain reaction techniques well known in the art can be 
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perfomed. Antisense polynucleotides are particularly relevant to regulating 
expression of NgR by those cells expressing NgR mRNA. 

Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in 
SEQ ID N0:1, 3, 13 or sequences complementary or homologous thereto, derived 

5 from the nucleotide sequences of the present invention encoding NgR are useful as 
diagnostic tools for probing gene expression in various tissues. For example, tissue 
can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native e5q)ression of this 
enzyme or pathological conditions relating thereto. In specific aspects, antisense 

10 nuddc acid molecules are provided that conq)rise a sequence complementary to at 
least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or 
to only a portion thereof. Nucleic ^cid molecules encodmg fi-agments, homologs, 
derivatives and analogs of a NgR protdn of SEQ ID N0:2, 4 or 14 or antisense 
nucleic adds complementary to a NgR nucleic acid sequence of SEQ ID NOs: 1, 3 or 

15 13 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding NgR The term 
"codmg region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino add residues (e.g., the protein coding region of human 

20 NgR corresponds to tiie coding region SEQ ID NO: 1, 3 or 13). In anotiier 

embodiment, the antisense nucleic add molecule is antisense to a "noncoding region" 
of the coding strand of a nudeotide sequence encoding NgR. The term "noncoding 
region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino adds (/.e., also referred to as 5' and 3' untranslated regions). 

25 Antisense oligonucleotides are preferably directed to regulatory regions of a 

nucleotide sequence of SEQ ID N0:1, 3, 13 or mRNA corresponding thereto, 
indudmg, but not limited to, the iiutiation codon, TATA box, enhancer sequences, and 
the like. Given the coding strand sequences encoding NgR disclosed herein (e.g., SEQ 
ID N0:1, 3 or 13), antisense nucldc acids of the invention can be designed according 

30 to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucldc 
add molecule can be complementary to the entire coding region of NgR mRNA, but 
more preferably is an oligonucleotide that is antisense to only a portion of the coding 



wo 02/29059 



PCTAJSOl/31488 



-36- 

or noncoding region of NgR mRNA. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of NgR mRNA. 
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 

5 constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stabiUty of the 
molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouradl, 5-chIorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytoane, 5-(carboxyhydroxyhnethyl) uracil, 5- 

15 carboxymethylammomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouradl, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methylademne, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanme, 5- 
methylaminomethyluradl, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

20 mannosylqueosme, 5 -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic add (v), wybutoxosine, pseudouradl, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouraciI, 4-thiouracil, 5- 
methyhiracil, uradl-5-03iyacetic add methylester, uracil-5-oxyacetic add (v), 5-methyl- 
2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

25 diaminopurine. Alternatively, the antisense nucleic add can be produced biolo^cally 
using an expression vector into which a nucldc add has been subcloned in an antisense 
orientation (ie., RNA transoibed fi'om the inserted nucleic add will be of an antisense 
orientation to a target nucldc add of interest, described fiirther in the following 
subsection). 

30 The antisense nucldc add molecules of the invention (preferably 

oligonucleotides of 10 to 20 nucleotides in length) are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA 
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and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the 
protein, e,g., by inhibiting transcription and/or translation. Suppression of NgR 
expression at either the transcriptional or translational level is usefiil to generate 
cellular or animal models for diseases/conditions characterized by aberrant NgR 
5 expression. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic add molecule 
that binds to DNA duplexes, through specific interactions in the major groove of the 
double helk. 

Phosphorothioate and methylphosphonate antisense oligonucleotides are 

10 specifically contemplated for therapeutic use by the mvention. The antisense 
oligonucleotides may be fiuther modified by adding poly-L-lysine, transferrin 
polylystne or cholesterol moieties at their S* end. 

An example of a route of admmistration of antisense nucleic add molecules of 
the invention includes direct injection at a tissue site. Alternatively, antisense nuddc 

IS add molecules can be modified to target selected cells and then administered 

systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g,, by linking the antisense nucleic acid molecules to peptides or 
antibodies that bind to cell surface receptors or antigens. The antisense nucleic add 

20 molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in 
which the antisense nucleic add molecule is placed under the control of a strong pol n 
or pol in promoter are preferred. 

In yet another embodiment, the antisense nucleic add molecule of the invention 

25 is an a-anomeric nucldc add molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary KNA in vMch, contrary to the 
usual P-units, the strands run paralld to each oth^ (Gaultier et aL, (1987) Nucleic 
Acids Res. 15, 6625-6641). The antisense nucleic add molecule can also comprise a 
2'-o-methyhibonucleotide (Inoue etal., (19^1) Nucleic Acids Res. 15, 6131-6148) or a 

30 chimeric RNA-DNA analogue (Tnoue et al, (1987) FEES Lett. 215, 327-330). 

The NgR sequences taught m the present invention fecihtate the design of 
novel transcription factors for modulating NgR expression in native cells and animals. 
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and ceUs transformed or transfected with NgR polynucleotides. For example, the 
Cys2-IBs2 zinc finger proteins, which bind DNA via their sdnc finger domains, have 
been shown to be amenable to structural changes that lead to the recognition of 
different target sequences. These artificial zinc finger proteins recognize specific target 

• 5 sites with high affinity and low dissociation constants, and are able to act as gene 
switches to modulate gene expression. Knowledge of the particular NgJR target 
sequence of the present invention facilitates the engineering of zinc finger proteins 
specific for the target sequence using known methods such as a combination of 
structure-based modeling and screening of phage display libraries (Segal et al^ (1999) 

10 Proc. Natl Acad. Set USA 96, nSl-nSi', Lm et al, (1997) Proc. Natl. Acad Sci. 
USA 94, 5525-5530; Greismaner a/. (1997) Science 275, 657^61; Choo etal., (1997) 
J. Mol. Biol 273, 525-532). Each zinc finger domain usually recognizes three or more 
base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in 
length to render it unique in any known genome, a 2dnc finger protein consisting of 6 

1 5 tandem repeats of zinc fingers would be expected to ensure specificity for a particular 
sequence (Segal et al, (1999), above). The artificial zinc finger repeats, designed 
based on the promoter of NgR sequences, are fiised to activation or repression 
domains to promote or suppress NgR expression (Liu et aL, (1997), above). The 
promoter of NgR may be obtained by standard methods known to one of ordinary skill 

20 in the art with the disclosure contained herein and knowledge of the NgR sequence. 
Alternatively, the zinc finger domains can be fiised to the TATA box-binding fiictor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et aL, (1997) Proc. 
Natl. Acad Sci. USA 94, 3616-3620. Such proteins and polynucleotides that encode 

25 them, have utility for modulating NgR expression in vivo in both native cells, animals 
and humans; and/or cells transfected with NgR-encoding sequences. The novel 
transcription &ctor can be delivered to the target cells by transfecting constructs that 
express the transcription fector (gene therapy), or by introducing the protein. 
En^eered anc finger proteins can also be designed to bind RNA sequences for use in 

30 therapeutics as alternatives to antisense or catalytic KNA methods (McColl et aL, 
(1997) Proc. Natl Acad Sci. USA 96, 9521-9526); Wu etal, (1995) Proc. NatL 
Acad Sci. USA 92, 344-348). The present invention contemplates methods of 
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designing such transcription fiictors based on the gene sequence of the invention, as 
well as customized zinc finger proteins, that are useful to modulate NgR expression in 
cells (native or transformed) whose genetic complement includes these sequences. 

S Ribozymes and PNA moieties 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease acti>aty that are 
capable of cleaving a single-stranded nucleic acid, such as an mBNA, to which they 
have a complementary regjon. Thus, ribozymes (e.g., hammerhead ribozymes, 

10 described in HaselhoflFand Gerlach (1988) Nature 334, 585-591) can be used to 
catalytically cleave NgR mKNA transcripts to th^eby inhibit translation of NgR 
mRNA. A ribozyme having specificity for a NgR-encoding nudeic add can be 
designed based upon the nucleotide sequence of a NgiR DNA disclosed herein (i. e. , 
SEQ ID NOs:l, 3 or 13). For example, a derivative of a Tetrahymena L-19 IVS RNA 

1 5 can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA 
See, e.g,, Cech et al U.S. Patent No. 4,987,071; and Cech et al U.S. Patent No. 
5,116,742. Alternatively, NgR mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity fi-om a pool of RNA molecules. See, e.g., Bartd et aL, 

20 (1993) Science 261, 1411-1418. 

Alternatively, NgR gene ^ression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory re^on of the NgR (e.g., the NgR 
promoter and/or enhancers) to form triple helical structures that prevent transcription 
ofthe NgR gene in target cells. See generaUy,Hdene (1991) i4>itfcawccri>rMgZ)e5. 6: 

25 569-584; Helene. et al, (1992) Arm. N.Y. Acad Set 660:27-36; and Maher (1992) 
BioEssqys 14, iOl'ilS. 

In varioiis embodim^s, the nucleic adds of NgR can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g, the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

30 backbone ofthe nucleic adds can be modified to generate peptide nucleic acids (see 
Hyrup et al,, (1996) Bioorg. Med Chem, Lett 4, 5-23). As used herein, the terms 
"peptide nucleic adds" or TNAs" refer to nucleic acid mimics, e.g., DNA mimics, in 
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which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone 
and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and KNA under conditions of 
low ionic strength. The synthesis of PNA oligomers can be performed using standard 

5 solid phase peptide synthesis protocols as described in Hyrup et aL, (1996) above; 
Perry-O-Keefe et al, (1996) Proc. Natl Acad Set USA 93,14670-14675. 

PNAs of NgR can be used in therapeutic and diagnostic applications. For 
©cample, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 

10 inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PGR clamping; as artifidal 
restriction ^izymes when used in combination with other enzymes, e.g:, SI nucleases 
(Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization 
(Hyrup etal., (1996), above; Perry-OKeefe (1996), above). 

15 In another embodiment, PNAs of NgR can be modified, to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be 
generated that may combine the advantageous properties of PNA and DNA Such 

20 chimeras allow DNA recognition enzymes, e.g. , RNase H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide hi^ binding 
aflSnity and specificity. PNA-DNA chimeras can be Unked using linkers of appropriate 
lengths selected in terms of base stacking, nimiber of bonds betwe^ the nucleobases, 
and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup (1996), above and Fum et al (1996) Nucleic Acids 
Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, 
e.g., S'-(4-metho:^trityl) amino-5'-deo^-thymidine phosphoramidite, can be used 
between the PNA and the 5' end of DNA (Mage/ a/. (19$9) Nucleic Acids Res. 17, 

30 973-988). PNA monomers are then coupled in a stepwise manner to produce a 

chimeric molecule with a 5' PNA segment and a 3* DNA segment (Finn et al. (1996), 
above). Alternatively, chimeric molecules can be synthesized with a 5* DNA segment 



wo 02/29059 



PCT/USOl/31488 



-41- 

and a 3* PNA segment. See, Petersen et al (1975) Bioorg. Med Chem. Lett. 5:1119- 
1124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 

5 transport across the cell membrane (see Letsinger et al, (1989) Proc. Natl Acad Sci. 
USA 86, 6553-6556; Lemaitre et al, (1987) Proc. Natl Acad Sci. USA 84, 648-652; 
PCX Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCX 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization triggered cleavage agents (see, e.g., Krol et al, (1988) Biotechniques 6, 

10 958-976) or intercalating agents (see, e,g., Zon (1988) Phtmn. Res. 5, 539-549). Xo 
this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a 
hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

Automated sequencing methods can be used to obtain or verify the nucleotide 
15 sequence of NgR. Xhe NgR nucleotide sequences of the present invention are believed 
to be 100% accurate. However, as is known m the art, nucleotide sequence obtained 
by automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at 
least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid 
20 molecule. Xhe actual sequence may be more precisely determined using manual 

sequencing methods, which are well known in the art. An error in a sequence which 
results in an insertion or deletion of one or more nucleotides may result in a fi-ame shift 
in translation such that the predicted amino add sequence will differ firom that \^ch 
would be predicted fi'om the actual nucleotide sequence of the nucleic add molecule, 
25 starting at the point of the mutatioa 

Polypeptides 

Xhe invention also provides purified and isolated mammalian KgR polypeptides 
encoded by a polynucleotide of the mvention. Presently preferred is a human NgR 
30 polypeptide comprising the amino acid sequence set forth in SEQ ID N0:2 or SEQ ID 
NO: 14. Another preferred embodiment is a mouse NgR polypeptide comprising the 
amino add sequence of NgP3, as set forth in SEQ ID N0:4. 
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One aspect of the invention pertains to isolated NgR proteins, and biologically 
active portions thereof or derivatives, fragments, analogs or homologs thereof. Also 
provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR 
antibodies. Preferably, fragments of NgR proteins comprise at least one biological 

5 activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or 
tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NgR proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a NgR protein or polypeptide can 
be synthesized chemically using standard peptide synthesis techniques. 

10 The invention also embraces polypeptides that have at least 99%, at least 95%, 

at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the 
preferred polypeptide of the invention. In addition, the invention embraces 
polypeptides having the consensus sequence shown in SEQ ID N0:6, shown in Table 

15 5) excluding the previously characterized NgR ("NgRl"), and polypeptides comprising 
at least about 90% of the consensus sequence. 

The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e,g., A, T, C, G, U, or I, in the case 

20 of nucleic acids) occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total mmiber of positions in the region 
of comparison (/.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herem denotes 
a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 

25 sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison r^on. 

In one aspect, percent homology is calculated as the percentage of amino add 
residues in the smaller of two sequences which align with identical amino acid residue 

30 in the sequence bdng compared, when four gaps in a length of 100 amino acids may be 
introduced to majdmize alignment (Dayhofl^ in ATLAS OF PROTEIN SEQUENCE AND 
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SmucTURE, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herem by reference). 

A determination of homology or identity is typically made by a computer 
homology program known in the art. An exemplary program is the Gap program 
5 (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer 
Group, University Research Park, Madison, WI) using the default settings, which uses 
the algorithm of Smith and Waterman (Adv, Appl Math, 1981, 2, 482-489, which in 
incorporated herein by reference in its entirety). Employing the GAP software 
provided in the GCG program package, (see Needlenum and Wunsch (1970) 1 Mol 

10 Biol 48, 443-453) the following settings for nucleic add sequence comparison may be 
used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encodmg) part of the DNA sequence shown in SEQ ID NOs:l, 3 or 13. 

1 5 BestFit was originally written for Version 1 .0 by Paul Haeberli from a careflil reading 
of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman 
(1981), above. The following Bestfit settings for nucleic acid sequence comparison 
may be used: GAP creation penalty of 8,0 and GAP extension penalty of 2, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 

20 identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the 
CDS (encoding) part of the amino acid sequence shown in SEQ ID N0s:2, 4 or 14. 

Alternatively, homology may be determmed by hybridization analysis wh^dn a 
nucleic acid sequence is hybridized to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent 

25 conditions. See e.g. Ausubel, et al., (Eds.) CURRENT PRqrocOLS n^ MOLECULAR 
BiolcXjY, John Wiley & Sons, New York, NY, 1993, and below. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. 

30 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially fi"ee of cellular material or other contaminating proteins from the cell or 
tissue source from which the NgR protein is derived, or substantially free from 



wo 02/29059 



PCT/USOl/31488 



-44- 

chemical precursors or other chemicals when chemically ^thesized. The language 
"substantially free of cellular material*' includes preparations of Ng|R protein in which 
the protein is separated from cellular components of the cells from which it is isolated 
or recombinantly produced. In one embodiment, the language "substantially free of 

5 cellular material" includes preparations of NgR protein having less than about 30% (by 
dry weight) of non-NgR protem (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NgR protein, still more preferably less 
than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR 
protein. When the NgR protein or biologically active portion thereof is recombinantly 

10 produced, it is also preferably substantially free of culture medium, i.e., culture 
medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about S% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NgR protein in which the protein is separated from chemical 

1 5 precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of NgR protein having less than about 30% (by dry 
weight) of chemical precursors or non-NgR chemicals, more preferably less than about 
20% chemical precursors or non-NgR chemicals, still more preferably less than about 

20 10% chemical precursors or non-NgR chemicals, and most preferably less than about 
5% chemical precursors or non-NgR chemicals. 

Biologically active portions of a NgjR protein mclude peptides comprising 
amino add sequences sufiBdently homologous to or derived from the amino add 
sequence of the NgR protein, e.g. , the amino add sequence shown in SEQ ID N0:2, 4 

25 or 14 that include fewer ammo adds than the full length NgR protdns, and exhibit at 
least one activity of a NgR protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the NgR protdn. A biolo^cally active 
portion of aNgiR protem can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino adds in length. 

30 A biologically active portion of a NgR protein of the present invention may 

contain at least one of the features that is conserved between the Ngjl proteins (e.g.^ a 
conserved cysteine as the N-tenninus of the mature protein, four conserved cysteines 
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in the N-tenrdnus before a leucine-rich region, four conserved cysteines C-terminal 
with respect to a leudne repeat region, eight leudne-rich repeats, and a hydrophobic 
C-terminus). An alternative biologically active portion of a NgR protein may contain 
at least two of the above-identified domains. Another biologically active portion of a 

5 NgR protein may contain at least three of the above-identified domains. Yet another 
biologically active portion of a NgR protein of the present invention may contain at 
least four of the above-identified domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one 

10 or more of the functional activities of a native NgR protein. 

In an embodiment, the NgR protein has an amino acid sequence shown in SEQ 
ID N0:2, 4 or 14. In other embodhnrats, the NgR protein is substantially homologous 
to SEQ ID N0:2, 4 or 14 and retains the fiinctional activity of the protem of SEQ ID 
N0:2, 4 or 14, yet dififers in amino add sequence due to natural allelic variation or 

IS mutagenesis, as described in detail below. 

Accordingly, in another embodiment, the NgR protein is a protein that 
comprises an anndno acid sequence at least about 45% homologous to the amino acid 
sequence of SEQ ID N0:2 or SEQ ID N0:4 or SEQ ID NO: 14 and retains the 
functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14. 

20 Use of mancmialian host cells is expected to provide for such post-translational 

modifications glycosylation, truncation, lipidation and phosphorylation) as may 
be needed to confer optimal biological activity on recombinant expression products of 
. the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are 
embraced by the invention. 

25 The invention also embraces variant (or analog) NgR polypeptides. In one 

example, insertion variants are provided wherein one or more amino add residues 
supplement a NgR amino add sequence. Insertions may be located at dther or both 
termini of the protein, or may be positioned within internal regions of the NgR amino 
add sequence. Insertional variants with additional residues at either or both termini 

30 can include, for example, fusion proteins and proteins including amino acid tags or 
labels. 
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Insertion variants include NgR polypeptides wherein one or more amino acid 
residues are added to a NgR add sequence or to a biologically active fragment thereof. 

Variant products of the invention also include mature NgR products, ie,, NgR 
products wherein leader or signal sequences are removed, with additional amino 
5 terminal residues. The additional amino terminal residues may be derived from another 
protein, or may include one or more residues that are not identifiable as being derived 
from specific proteins. NgR products with an additional methionine residue at 
position -1 (Met'^-NgR) are contenq)lated, as are variants with additional methionine 
and lysine readues at positions -2 and -1 (Met'^-Lys'^-NgR). Variants of NgR with 
10 additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are 
particularly usefiil for enhanced reconibinant protein production in bacterial host cells. 

Polypeptide Variants 

The invention also embraces NgR variants having additional amino acid 

IS residues which result from use of specific expression systems. 

As used herein, a NgR "chimeric protein" or "fiision protein" comprises a NgR 
polypeptide operatively linked to a non-NgR polypeptide. A ^NgR polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to NgR, whereas a 
"non-NgR polypeptide" refers to a polypeptide having an amino acid sequence 

20 corresponding to a protein that is not homologous to the NgR protein, e.g., a protein 
that is different from the NgR protein and that is derived from the same or a different 
organism. Within a NgR fiision protein the NgR polypeptide can correspond to all or a 
portion of a NgR protein. In one embodiment, a NgR fiision protein comprises at least 
one biologically active portion of a NgR protdn. In another embodiment, a NgR 

2S fiision protein comprises at least two biologically active portions of a NgR protein. In 
yet another embodiment, a NgR fiision protein comprises at least three biologically 
active portions of a NgR protein. Within the fiision protein, the term "operatively 
linked" is mtended to indicate that the NgR polypeptide and the non-NgR polypeptide 
are fiised in-frame to each other. The non-NgR polypeptide can be fiised to the N- 

30 terminus or C-terminus of the NgR polypeptide. 

For example, in one embodiment a NgR fiision protein comprises a NgR 
domain operably linked to the extracellular domain of a second protein. Such fiision 
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protdns can be further utilized in screening assays for compounds which modulate 
NgR activity (such assays are described in detail below). 

For example, use of commercially available vectors that express a desired 
polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the 

5 desired polypeptide having an additional glycine residue at position -1 after cleavage of 
the GST component from the desired polypeptide. 

In another embodiment, the fiision protein is a NgR protein containing a 
heterologous signal sequence at its N-tenninus. For example, the native NgR signal 
sequence (i.e,, amino adds 1-30 of SEQ ID N0:2 and amino acids 1-40 of SEQ ID 

10 N0:4) can be removed and replaced with a signal sequence from another proteia In 
certain host cells (e.g., mammalian host cells), expression and/or secretion NgR can be 
increased through use of a heterologous signal sequence. 

In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion 
protein in which the NgR sequences comprising one or more domains are fiised to 

1 5 sequences derived from a member of the immunoglobulin protein femily . The Ng^R- 
immunoglobulin fiision proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress 
NgR-mediated signal transduction in vivo. NgR-immunoglobulin fiision proteins can be 

20 used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR 
ligand/NgR interaction may be usefiil therapeutically for both the treatment of 
proliferative and diff^enliative disorders, as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the NgR-unmimoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NgR antibodies in a subject, to 

25 purify NgR ligands, and in screening assays to identify molecules that inhibit the 
interaction of NgR with NgR ligand. 

A NgR chimeric or fiision protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

30 techniques, e.g, , by employing blunt-ended or stagger-ended termini for ligation, 

restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and 
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enzymatic ligatioa In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PGR 
amplification of gene fragments can be carried out using anchor primers that give rise 
to complementary overhangs between two consecutive gene fragments that can 

5 subsequently be annealed and reampUfied to generate a chimeric gene sequence (see, 
for example, Ausubel et aL (Eds.) QjrkbotProt(x:ols INMolecuiarBioi^^ 
John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fiision moiety {e.g., a GST polypeptide). A NgR- 
encodmg nucleic add can be cloned into such an expression vector such that the fusion 

1 0 moiety is linked in-frame to the NgR proteia 

Variants resulting from expression in other vector systems are also 
contemplated. 

Insertional variants also include fusion proteins wherein the amino termimis 
and/or the carboxy terminus of NgR is/are fused to another polypeptide. 

IS In another aspect, the invention provides deletion variants wherein one or more 

amino add residues in a NgR polypeptide are removed. Deletions can be eflFected at 
one or both termini of the NgR polypeptide, or with removal of one or more 
non-terminal amino acid residues of NgR. Deletion variants, therefore, include all 
fragments of a NgR polypeptide. 

20 The invention also embraces polypeptide fragments of the sequence set forth in 

SEQ ID N0:2, 4 or 14 wherein the fragments maintain biological {e.g., ligand bincUng 
and/or intracellular signaling) immunological properties of a NgR polypeptide. 
Fragments comprising at least 4, S, 10, IS, 20, 2S, 30, 3S, or 40 consecutive amino 
adds of SEQ ID N0:2, 4 or 14 are contemplated by the invention. Preferred 

2S polypeptide fragments display antigenic properties imique to, or spedfic for, human 
NgR and its alldic and spedes homologs. Fragments of the invention having the 
desired biological and immunological properties can be prepared by any of the methods 
well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of NgR 

30 polypeptides. Substitution variants include those polypeptides wherein one or more 
amino add residues of a NgR polypeptide are removed and replaced with alternative 
residues. In one aspect, the substitutions are conservative in nature; however, the 
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invention embraces substitutions that are also non-conservative. Conservative 
substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below. 



Table 1. 





Column I 


Column n 


(based on a NTLRRCT 


(R1,R2,R3) 


(Et2+R3 only) 


domain) 








G,R,M 




X2 


A,D.C 




X3 


V,T 




X4 


N,P,S 




X5 


E,A,S 




x« 


nothing, K 


nothing 


Xr 


V,M,P 




Xg 


T, V 


V 


X9 


Q,P 


Q 




aA 


Q 


Xn 


Q.H,N 






G,N 


N 


x„ 


L.F 


F 




Q,A,S 




X,5 


A,S 




x« 


v,i 




x„ 


V,T,E,L 




Xig 


S,G 




Xi9 


L.I 




X20 


A.E.V,P 




X21 


A,S.D 




X22 


S,T 




Xb 


Q.E 





wo 02/29059 



PCT/USOl/31488 



-50- 





Column I 


Column n 


(based on a NTLRRCT 


(RI9 R2, R3) 


(R2+R3 only) 


aomain) 








T\TT 






Q,H 


Q 




XT 


XT 


-^27 


T> T 

R,L 




Y 
-^28 


T,G,R,S 




Y 


F,L,T,H 




Y 

-^30 


T \7 


T 


V 
X31 






-^32 


Q,P,A 


T> 

F 


Y 

-^33 


VT,A 






TT rp 0 

H,T,S 




Y 
A35 


S,G,R 




Y 


P,S,A 




Y" 
A.37 


C, nothing 


nothing 


Y" 
-^38 


R, nothing 


nothing 


-^39 


A XT 

A,N 




Y^ 


"K >r T 

M,L 




•XT 


VAT 






T,I 


T 




T T 












XT XT' 

N,V 


XT 


x« 


IX 






T,S,A 




x« 






X49 


AJH,Y.D 





wo 02/29059 



PCT/USOl/31488 



-51- 





Column I 


Column n 


(^DAsea on a in iijJKJvv^i 


mi "D'> 

(KI9 jiZ) 10 j 


(KZ'tjo oniy^ 










P A 


P 










T A 


T 




PPT 




Y 
A54 






Y 


V T 




Y 


V n TT 




Y 


WAT 








x# 


Y 




vT 


Y 


Tl A 


Xv 


Y 


n TT 


n 


Y 


IV, XI 


PT 


Y 






Y 


T V 


T 




APT! 






P n A 




Y 








VP fr 




Y 






Y 

^70 








AST 














Y 


X74 


K.R 




x„ 


G.Q 





wo 02/29059 



PCT/USOl/31488 



-52- 



(based on a NTLRRCT 
domain) 


Column I 
(Rl, R2, R3) 


Column n 
(R2+R3 only) 




S,Q 


0 
0 




ACT? 

A,S,E 




-^78 


P,G 


r 


Y 


A,G,P 




Y 

^80 






Y" 
-^81 


I,V,L 




Y 


G,K 






TT X 7 A 

H,V,A 




Y" 


S,A 


0 
0 


Y" 
-^85 


TV T? 

D,E 




Y 
^86 


XT 0 A 






T T 




Y 


E,L,Q 




Y* 


TT" TT A 

Y,H,A 




Y" 




Q 


Y" 


D, N 






T T T* 

I,L,T 




Ag3 






X94 


V,A,G 






S,T 


s 










T T 

L,I 


T 

L 


^8 


W,R,S 




X99 








L,V 


L 




G,T^ 





wo 02/29059 



PCT/USOl/31488 



-53- 





Column I 


Column n 


(based on a IS iJ^KKCl 


roi U'^ "Di^ 
(Kl, K2, JOj 


^Ivs+kj oniyj 


aoniam ) 






-^102 














TT V A 




Y 

-^105 


V rr w 




Y 

-^106 


IN, a 




Y 






Y 

-^108 






Y 


T V 




Y 






Y 

^lll 


w,vj,rx 




Y 


TTl? P 

XI, JK,^ 




Y 

-^113 






Y 


"HP 




Y 

-^115 


LI,VJ 




Y 






Y 

-^117 






Y 

-^118 


T T 
1,1 




Y 

-^119 






Y 


XT A 




Y 

-^121 






Y 

-^122 


T A Q 




Y 

-^123 






-^124 






^125 


G,T 




Xi26 




D 


^127 







wo 02/29059 



PCT/USOl/31488 



-54- 



^DaScIl on a J. J-iIVJI\.\^ X 


Column I 


Column n 


V 

^128 


PD 






VGPR 




Y 

^130 






-^131 


EO 


0 


-^132 


F Y 


F 


X ^, 

-^133 


GAD 




X 

-^134 


AP 




X 

-^135 


D A V 

X^,/\, V 






GD 
vj,xy 




X , 

-^137 


A.E 

^Xl 




Xi38 


S P 




X 

-^139 


E A 




-^140 


LF 




X , 


R O 




X 




XV 


X 


*R X 


1> 

JV 


V 

^144 


V A 




X 

^145 


G V 




X ^ 


A D F 




X 

-^147 


TP 




X „ 

-^148 






V 


T SL 




Xiso 


E,G,P,Q 




Xi5i 






Xis2 


K.L 


R 


Xi53 


G.D 





wo 02/29059 



PCTAJSOl/31488 



-55- 





Column I 


Column 11 


/liiicpH nn n NTT .TIP PT 




^JV6 I .IVJ Ullljr ^ 


domain) 






Y 

'M54 






^155 


Oil 




-^156 






^157 


L A.R 




■^158 




R 


-^159 


V AE 




Xjgo 


E,A,N 




^161 


F,L 


F 


^162 


R,Q 




Xlfi3 


NAG 





Variant polypeptides include those wherein conservative substitutions have 
been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
15 contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino add for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 
(from WO 97/09433, page 10, pubKshed March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), unmediately below. 



20 



Table 2 
Conservative Substitutions I 
SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

ILV 
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Polar - uncharged C S T M 

NQ 

Polar - charged D E 

KR 

Aromatic HFWY 

Other NQDE 
5 Alternatively, conservative amino adds can be grouped as described in 

Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, NY 

(1975), pp.71-77] as set out in Table 3, immediately below. 

Table 3 

Conservative Substitutions n 

10 



15 



SIDE CHAIN 




CHARACmOSTIC 


AMINO ACID 


Non-polar (hydrophobic) 




A. Aliphatic: 


ALIVP 


B. Aromatic: 


FW 


C. Sulfiu--containing: 


M 


D. Boderline: 


G 


Uncharged-polar 




A. Hydroj^l: 


STY 


B. Amides: 


NQ 


C. Sylfhydiyl: 


C 


D. Bodeiline: 


G 


Positively Charged (Basic): 


KRH 


N^tivdy Charged (Acidic): 


DE 



20 



As still another alternative, exemplary conservative substitutions are set out in 
25 Table 4, below. 

Table 4 

Conservative Substitutions ID 

Original Exemplary Substitution 
Residue 

Ala (A) Val,Leu,IIe 

30 Arg (R) Lys, Gin, Asn 

Asn (N) Gin, His, Lys, Arg 

Asp(D) Glu 

Cys (C) Ser 

Gln(Q) Asn 

35 Glu (E) Asp 



wo 02/29059 



PCT/USOl/31488 



-57- 



His(H) 


Asn, Gin, Lys, Arg 


He (I) 


Leu, Val, Met, Ala, Phe, 


Leu(L) 


De, Val, Met, Ala, Phe 


Lys(K) 


A^ CUn, AsQ 


Met(M) 


Leu, Phe, He 


Phe(F) 


Leu, Val, lie, Ala 


Pro(P) 


Gly 


Ser(S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Val(V) 


He, Leu, Met, Phe, Ala 



In addition, amino add residues that are conserved among family members of 
1 5 the NgR proteins of the present invention, as indicated by the alignment presented 
herein, are also predicted to be particularly unamenable to alteration. For example, 
NgR proteins of the present invention can contain at least one domain that is a typically 
conserved region in NgRs. Examples of these conserved domains include, e.g., 
leudne-rich repeat domain. Amino acid residues that are not conserved or are only 
20 semi-conserved among members of the NgR proteins may be readily amenable to 
alteration. 

Full-length Ng^ have an LKR region characterized by the amino add 
consensus sequence shown in SEQ ID NO: 19. At least some fiiU-length NgRs also 
include a CT signaling (CTS) domain and a CjPI domain. 
25 The NgR domain designations used herein are defined as follov/s: 



Domain 
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SEQ ID: 5 
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77-100 


142-165 
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1 JJ— 1 /O 
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lUi— 


loo— isy 


LRR6 


179-202 


179-202 


180-203 


125-148 


190-213 


LRR7 


203-226 


203-226 


204-227 


149-172 


214-237 


LRR8 


227-250 


227-250 


228-251 


173-196 


238-261 


LRRCT 


260-309 
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439-462 



10 

In some embodiments of the invention, the above domains are modified. 
Modification can be in a manner that preserves domain fimctionality. Modification can 
include addition, deletion or substitution of certain amino acids. Exemplary 
modifications inchide conservative amino acid substitutions. Preferably such 

15 substitutions number 20 or fewer per 100 residues. More preferably, such 

substitutions number 10 or fewer per 100 residues. Further exemplary modifications 
include addition of flanking sequences of up to five amino acids at the N terminus 
and/or C terminus of one or more of the domains. 

In some embodiments, the isolated nucleic acid molecule encodes a polypeptide 

20 at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% 
homologous to SEQ ID N0:2, 4 or 14. 

Mutations can be introduced into SEQ ID NOS: 1, 3 or 13 by standard 
techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. 
Conservative amino acid substitutions caabe made at one or more amino acid residues 

25 predicted to be non-essential. Ahematively, mutations can be introduced randomly 
along a NgR coding sequence. This can be accomplished, e.g., by saturation 
mutagenesis. The resulting mutants can be screened for NgR biological activity. 
Biological activities of Ng^ may inchide but are not limited to: (1) protdniprotein 
interactions, e.g., with other Ng|Rs or other cell-sur&ce proteins involved in Nogo- 

30 related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti- 
NgR antibody. 
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It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino add residues. By way of example, the modifications may be 
covalent m nature, and include for example, chemical bonding with polymers, lipids, 

5 other organic and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention 
fiirther embraces NgR polypeptides that have been covalently modified to include one 
or more water-soluble polymer attachments such as polyethylene glycol, 

10 polyoxy ethylene glycol or polypropylene glycol. Variants that display ligand binding 
properties of native NgR and are expressed at higher levels, as well as variants that 
provide for constitutively active receptors, are particularly usefiil in assays of the 
invention; the variants are also usefiil in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant NgR activity. 

1 5 Chemically modified NgR polypeptide compositions in which the NgR 

polypeptide is linked to a polymer are included within the scope of the present 
invention. The polymer may be water soluble to prevent precipitation of the protein in 
an aqueous environment, such as a physiological environment. Suitable water-soluble 
polymers may be selected fi-om the group consisting of, for example, polyethylene 

20 glycol (PEG), monometho^q^olyethylene glycol, dextran, cellulose, or other 
carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, 
polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer 
polyo?qreth^ated polyols (e.g. gjiycerol) and polyvinyl alcohol. The selected polymer is 
usually modified to have a single reactive group, such as an active ester for acylation or 

25 an aldel^de for alkylation, so that the degree of polymerization may be controlled. 
Polymers may be of any molecular weight, and may be branched or unbranched, and 
mixtures of such polymers may also be used. Whra the chemically modified NgR 
polymer is destined for therapeutic use, phannacwtically acceptable polymers will be 
selected for use. 

30 When the polymer is to be modified by an acylation reaction, the polymer 

should have a single reactive ester group. Aftematively, if the polymer is to be 
modified by reductive alkylation, the polymer should have a single reactive aldehyde 
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group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is 
water stable, or mono Cl-ClO alkojqr or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714, incorporated by reference herein in its entirety). 

Pegylation of NgR polypeptides may be carried out by any of the pegylation 

5 reactions known in the art, as described, for example, in the following references: 
Focus on Growth Factors 3, 4-10 (1992); EP 0 1 54 3 16; and EP 0 401 384 (each of 
which is incorporated by reference herein in its entirety). Preferably, the pegylation is 
carried out via an acylation reaction or an alkylation reaction with a reactive 
polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A 

10 preferred water-soluble polymer for pegylation of polypeptides such as NgR is 
polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to 
encompass any of the forms of PEG that have been used to derivatize other proteins, 
such as mono (Cl-CIO) alkoxy- or aryloxy-polyethylene glycol. 

Chemical derivatization of NgR polypeptides may be performed under any 

1 5 suitable conditions used to react a biologically active substance with an activated 

polymer molecule. Methods for preparing pegylated NgR polypeptides will generally 
comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a 
reactive ester or aldehyde derivative of PEG, under conditions whereby NgR 
polypeptide becomes attached to one or more PEG groups, and (b) obtaining the 

20 reaction products. It will be apparent to one of ordinary skill in the art to select the 
optimal reaction conditions or the acylation reactions based on known parameters and 
the desired result. 

Pegylated and other polymeriNgR polypeptides may generally be used to treat 
conditions that may be alleviated or modulated by administration of the NgR 

25 polypeptides described herein. However, the chemically-derivatized polymer:NgR 
polypeptide molecules disclosed herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, such as increased or decreased 
half-life, as conq}ared to the nonderivatized molecules. The NgR polypeptides, 
fragments thereof variants and derivatives, may be employed alone, together, or in 

30 combination with other pharmaceutical compositions. The cytokines, growth &ctors, 
antibiotics, antiinflammatories and/or chemother£43eutic agents as is appropriate for the 
indication being treated. 
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The present invention provides compositions comprising purified polypeptides 
of the invention. Preferred compositions comprise, in addition to the polypeptide of 
the invention, a pharmaceutically acceptable (i.e., sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium, 

5 Any diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium 
stearate, methyl- and propylhydroxybenzoate, talc, algmates, starches, lactose, sucrose, 
dextrose, sori^itol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter. 
Variants that display ligand binding properties of native NgR and are expressed 

10 at higher levels, as well as variants that provide for constitutively active receptors, are 
particularly usefiil in assays of the invention; the variants are also useftd in assays of the 
invention and in providing cellular, tissue and animal models of diseases/conditions 
characterized by aberrant NgR activity. 

With the knowledge of the nucleotide sequence information disclosed in the 

1 S present invention, one skilled in the art can identify and obtain nucleotide sequences 
which encode NgR from diflFerent sources (/.e., different tissues or different organisms) 
through a variety of means well known to the skilled artisan and as disclosed by, for 
example, Sambrook etaL, Molecular CLONING: A Laboratory Manual, Second 
Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), which is 

20 incorporated herein by reference in its entirety. 

For example, DNA that encodes NgR may be obtained by screening of mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene 
sequence information provided herdn. Probes may be labeled with a detectable group, 
such as a fluorescent group, a radioactive atom or a chemiluminescent group in 

25 accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook ei al, (1989) above. 

A nucleic add molecule comprising any of the NgR nucleotide sequences 
described above can altemativdy be synthesized by use of the polymerase chain 
reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 

30 nucleotide sequences provided herein. See U.S. Patent Nos. 4,683,195 to MuUis et al. 
and 4,683,202 to MuUis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
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sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either smgle- or 
double-stranded DNA. The essence of the method involves the use of two 
oligonucleotide probes to serve as primers for the template-dependent, polymerase- 

S mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 
are well known to those skilled in the art. Examples of these techniques are found in, 
fiar example, Berger et al. Guide to Molecular Cloning Techniques, METHODS IN 
ENZYMOLOGY 152 Academic Press, San Diego, CA, which is incorporated herein by 

10 reference in its entirety. 

The nucleic add molecules of the present hivention, and fi-agments derived 
therefirom, are useful for screening for restriction fi-agment length polymorphism 
(RFLP) associated with certain disorders, as well as for genetic mapping. 



IS Antibodies 

Also comprehended by the present invention are antibodies (e.g., monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifimctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds 

20 which include CDR sequences which specifically recognize a polypeptide of the 

invention) specific for NgR or firagments thereof Preferred antibodies of the invention 
are human antibodies which are produced and identified according to methods 
described in W093/1 1236, published June 20, 1993, which is incorporated herem by 
reference in its entirety. Antibody firagments, including Fab, Fab', F(ab')2, and Fy, are 

25 also provided by the invention. The term "specific for," when used to describe 

antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind NgjR polypeptides exclusively are able to distinguish 
NgR polypeptides firom other known NgR polypeptides by virtue of measurable 
difiTerences in binding afiBnity, despite the possible existence of localized sequence 

30 identity, homology, or similarity between NgR and such polypeptides). 

The antigenic peptide of NgR comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID N0:2, 4 or 14 and encompasses an epitope of 
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NgR such that an antibody raised against the peptide forms a specific inmiune complex 
withNgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, 
more preferably at least IS amino add residues, even more prrferably at least 20 amino 
add residues, and most preferably at least 30 amino acid residues. Preferred epitopes 

5 encompassed by the antigenic peptide are regions of NgR that are located on the 
surface of the protein, e.g,, hydrophilic regions. 

It will be understood that specific antibodies may also interact witii other 
proteins (for example, S, aureus protem A or other antibodies in ELIS A techniques) 
through interactions with sequences outside the variable region of the antibodies, and, 

10 in particular, in the constant re^on of the molecule. Screening assays to determine 
binding specifidty of an antibody of the invention are well known and routinely 
practiced in the art. For a comprehensive discussion of such assays, see Harlow et aL 
in Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press; Cold 
Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fi-agments 

IS of the NgR polypeptides of the invention are also contemplated, provided that the 
antibodies are spedfic for NgR polypeptides. Antibodies of the invention can be 
produced using any method well known and routinely practiced in the art. 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immtmized by injection with the native 

20 protein, or a synthetic variant thereof or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, recombinantiy esqpressed NgR 
protem or a chemically synthesized NgR polypeptide. The preparation can fiirther 
include an adjuvant. Various adjuvants used to increase the immunological response 
indude, but are not limited to, Freund*s (complete and incomplete), mineral gels (e.g., 

25 aluminum hydroxide), sur&ce active substances (^.g., lysoledthin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as 
Bacille Calmette-Guerin cmd Corynebacterium parvum or similar immimostimulatory 
agents. If desired, the antibody molecules directed against Ng|R can be isolated firom 
the mammal (e.g. , firom the blood) and fiirther purified by well known techniques, such 

30 as protein A chromatography to obtain the IgG fi"actioa 

The term "monoclonal antibody" or "monoclonal antibody composition," as 
used herein, refers to a population of antibody molecules that contain only one spedes 
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of an antigen binding site capable of immunoreacting with a particular epitope of NgR. 
A monoclonal antibody composition thus typically displays a single binding afBnity for 
a particular NgR protein with which it immimoreacts. For preparation of monoclonal 
antibodies directed towards a particular NgR protein, or derivatives, fragments, 

5 analogs or homologs thereof, any technique that provides for the production of 
antibody molecules by continuous cell line culture may be utilized. Such techniques 
include, but are not hmited to, the hybridoma technique (see Kohler and Milstein 
(1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma 
technique (see Kozbor et cd,^ (1983) ImmunoL Today 4, 72) and the EBV hybridoma 

10 technique to produce human monoclonal antibodies (see Cole et aL, (1985) in 
MoNOCU)NALA]mBODiES AND CANcm Therapy^ AlanR Liss, Inc., pp. 77-96). 
Human mionoclonal antibodies may be utilized in the practice of the present invention 
and may be produced by using human hybridomas (see Cote et aL, (1983) Proc. Natl 
Acad. Sci. USA 80, 2026-2030) or by transfonning human B-cells with Epstein Barr 

1 5 Virus in vitro (see Cole et al. , (1985), above). 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to aNgP. protein (see e.^., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression Ubraries 
(see e.g., Huse et al, (1989) Science 246, 1275-1281) to allow rapid and eflfective 

20 identification of monoclonal Fgj, fragments with the desired specificity for a NgR 

protein or derivatives, fragments, analogs or homologs thereof Non-human antibodies 
can be "humanized" by techniques well known in the art. See e.g., U.S. Patent No. 
5,225,539. In one method, the non-human CDRs are inserted into a human antibody 
or consensus antibody framework sequence. Further changes can then be introduced 

25 into the antibody framework to modulate afiBnity or immunogenidty. Antibody 

fragments that contain the idiotypes to a NgR protein may be produced by techniques 
known in the art inchiding, but not limited to: 0 an F(ab*)2 fragment produced by 
pepsin digestion of an antibody molecule; Oi) an Fab fragment generated by reducing 
the disulfide bridges of an F(ab')2 fragment; (iii) an Fab fragment generated by the 

30 treatment of the antibody molecule witii papain and a reducing agent and (iv) F y 
fragments. 
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Additionally, recombinant anti-NgK antibodies, such as chimeric and 
humanized monoclonal antibodies, conq)rising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal antibodies can be 

5 produced by recombinant DNA techniques known in the art, for example using 

methods described in PCX International Application No. PCT/US86/02269; European 
Patent Application No. 184,187; European Patent Application No. 171,496; European 
Patent Application No. 173,494; PCT Ihteraational Publication No. WO 86/01533; 
U.S. Pat. No. 4,816,567; European Patent Application No. 125,023; Better etaL, 

10 (1988) Science 240, 1041-1043; Liu et al, (1987) Proc. Nail Acad Set USA 84, 
3439-3443; Liu et al, (1987) 1 Immunol 139, 3521-3526; Sun et al, (1987) Proc. 
Natl Acad Set USA 84, 214-218; Nishimura et al, (1987) Cancer Res, 47, 999-1005; 
Wood etal, (19%5) Nature 314, 446-449; Shaw etal,. (1988) j: Natl Cancer Inst: 
80, 1553-1559); Morrison (1985) Science 229, 1202-1207; Oi etal, (1986) 

15 BioTechniques 4, 214; U.S. Patent. No. 5,225,539; Jones et al, (1986) Nature 321, 
552-525; Verhoeyan et al, (1988) Science 239, 1534; and Bddler et al, (1988) J. 
Immunol 141, 4053-4060. 

In a preferred embodiment of the invention a portion of a NgR is joined to an 
Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion 

20 protein is soluble. The NgR/Fc fusion protein may be formed by recombinant 

techniques as described above. In one embodiment, a portion of a NgR including the 
entire amino acid sequence of NgR except the C-terminal hydrophobic region is fiised 
to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR 
and the Fc is also human. More preferably, the human Fc portion is derived from an 

25 IgG antibody. In oth^ embodiments, theN-terminal signal sequence is omitted. Such 
antibodies are usefiil in binding Nogo to prevent Nogo signaling through the NgR. 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity inchide, but are not limited to, enzyme-linked immunosorbent assay 
(ELISA) and other immunologically-mediated techniques known within the art. In a 

30 specific embodiment, selection of antibodies that are specific to a particular domain of 
a NgR protein is facilitated by generation of hybridomas that bind to the fi-agment of a 
NgR protein possessing such a domain. Antibodies that are specific for one or more 
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donuuns within a NgR protein, e.g:, domains spanning the above-identified conserved 
re^ons of NgRs, or derivatives, fragments analogs or homologs thereof, are also 
proNdded herein. 

Anti-NgR antibodies may be used in methods known within the art relating to 
5 the localization and/or quantitation of a NgR protein (e,g, , for use in measuring levels 
of the NgR protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies for NgR proteins, or derivatives^ firagments analogs or homologs thereof, 
that contain the antibody derived binding domain, are utilized as phannacologically- 

10 active conq)ounds [hereinafter "Therapeutics"]. 

An anti-NgR antibody (e,g., monoclonal antibody) can be used to isolate NgR 
by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti-NgR antibody can fecilitate the purification of natural NgR firom cdls and of 
recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody 

IS can be used to detect NgR protein {e,g., in a cellular lysate or cell supernatant) in order 
to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgP. 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) 

20 the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 

25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescdn isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
hiciferase, lucif^in and aequorin, and examples of suitable radioactive matmal include 

30 ^^I,^^^I,^^Sor^. 

Another aspect of the present mvention is directed to methods of inducing an 
immune response in a Tnamrnal against a polypeptide of the invention by administering 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like 
but can be determined by those skilled in the art. 

Another aspect of the invention is directed to anti-idiotypic antibodies and 

5 anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes 
determinants of another antibody (a target antibody). Generally, the anti-idiotypic 
antibody recognizes determinants of the antigen-binding site of the target antibody. 
Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is 
generally prepared by immunizing an animal (particularly, mice) of the same species 

10 and genetic type as the source of the target monoclonal antibody, with the target 
monoclonal antibody. The immunized ammal mounts an immune response to the 
idiotypic determinants of the target monoclonal antibody and produces antibodies 
against the idiotypic determinants of the target monoclonal antibody. 
Antibody-produdbog cells, such as splenic cells, of the immunized animal may be used 

15 to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic 
antibody may also be used to immunize animals to produce anti-anti-idiotypic 
antibodies. These immunized animals may be used to generate anti-anti-idiotypic 
monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies 
may bind to the same epitope as the original, target monoclonal antibody used to 

20 prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other 
monoclonal antibodies with the same antigen specificity as the original target 
monoclonal antibody. 

If the bindmg of the anti-idiotypic antibody with the target antibody is inhibited 
by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces 

25 an antibody response with the same spedficity as the target antibody, it mimics the 
antigen of the target antibody. Such an anti-idiotypic antibody is an '^internal image 
anti-idiotype" and is capable of inducing an antibody response as if it were the original 
antigen. (Bona and Kohler (1984) ANn-ffi>lOTYPIC ANTIBODIES AND INTERNAL IMAGE, 
dj^ Monoclonal and anti-idiotyhc antibodies: Probes for receptor structure 

30 AND function. Venter J.C. et al (Eds), Alan R. Liss, New York, NY, pp 141-149, 
1984). Vaccines incorporating internal image anti-idiotype antibodies have been 
shown to induce protective responses against viruses, bacteria, and parasites (Kennedy 
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etaL, (1986)232,220-223; 1047;McNamara5/^7/., (\9%5) Science 226, 1325-1326); 
Intemal image anti-idiotypic antibodies have also been shown to induce inununity to 
tumor related antigens (Raychauhuri etcd., (1986) 7. Immunol 137, 1743-1749; 
Raychauhurie/a/., J. Immunol 139, 3902-3910; Bhattacharya-Chatteijeee/ 
5 a/., (1987) 1 Immunol 139, 1354-1360; Bhattacharya-Chatterjee et al, (1988) 1 
Immunol 141, 1398-1403; Herlyn. etal (I9i9) Intern, Rev. Immunol 4, 347-357; 
Chen et al (1990) Cellimm, Immunother. Cancer 351-359; Herlyn et al, (1991) in 
vivo 5, 615-624; Furuya et al (1992) Anticancer Res. 12, 27-32; Mttehnan, A. et al 
(1992) Proc. Natl Acad ScL, USA 89, 466-470; Durrant. etal, (1994) Cancer Res, 
10 54, 4837-4840; Mttelman. et al (1994) Cancer Res. 54, 415-421; Schmitt. et al 

(1994) Hybridoma 13^ 389-396; Chakrobarty. etal. (1995)/. Immunother. 18, 
95-103; Chakrobarty. etal (1995) Cancer Res. 55, 1525-1530; Foon, K. A. etal. 

(1995) CUn, Cancer Res. 1, 1205-1294; Herlyn a/. (1995) Hybridoma 14, 159-166; 
Sclebuschc^a/. (199S) Hybridoma 14, 167-174; Herlyn. etal. (1996) Cancer 

15 Immunol Immunother. 43, 65-76). 

Anti-idiotypic antibodies for NgR may be prepared, for example, by 
immunizing an animal, such as a mouse, with a immunogenic amount of a composition 
comprising NgR2 (SEQ ID N0:2), NgR3 (SEQ ID N0s:4 or 14), or immunogenic 
portion thereof, containing at least one antigenic epitope of NgR. The composition 

20 may also contain a suitable adjuvant, and any carrier necessary to provide 

immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the 
cells of the iromunized animal as described above. A monoclonal antibody recognizing 
an epitope of NgR is then selected and used to prepare a composition comprising an 
immimogenic amount of the anti-Ng^ monoclonal antibody. Typically, a 25 to 200 |ig 

25 dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant. 

Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. 
Typically, animals are immunized by any suitable route of administration, such as 
intraperitoneal, - subcutaneous, intravenous or a combination of these. Anti-idiotypic 
antibody production may be monitored during the imnumization period using standard 

30 immunoassay methods. Animals with suitable titers of antibodies reactive with the 
target monoclonal antibodies may be rdmmunized with the monoclonal antibody used 
as the immunogen three days before harvesting the antibody producing cells. 
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Preferably, splera cells are used, although other antibody producing cells may be 
selected. Antibody-producing cells are harvested and fosed with myeloma cells to 
produce Hybridomas, as described above, and suitable anti-idiotypic 
antibody-producing cells are selected. 
5 Anti-anti-idiotypic antibodies are produced by another round of immunization 

and Hybridoma production by using the anti-idiotypic monoclonal antibody as the 
immunogen. 

Antibodies of the invention are usefiil for, e.g:, therapeutic purposes (by 
modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and 
10 purification of NgR. Therefore, kits comprismg an antibody of the invention for any of 
the purposes described herem are also comprehended. 



Kits 

The present invention is also directed to kits, including pharmaceutical kits. 

IS The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above^ as well appropriate controls, such as positive and/or 
negative controls. The kit preferably comprises additional components, such as, for 
example, instructions, solid support, reagents helpfiil for quantification, and the like. 

20 For example, the kit can comprise: a labeled compound or agent capable of detecting 
NgR protein or mRNA in a biological sample; means for determining the amount of 
NgR in the sample; and means for comparing the amoimt of NgR in the sample with a 
standard. The conq)ound or agent can be packaged in a suitable container. 



25 Screening Assays 

The DNA and amino add sequence information provided by the present 
invention also makes possible idratification of binding partner compounds with which 
a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner 
compounds inchide solution assays, in vitro assays wherein NgR polypeptides are 

30 immobilized and cell-based assays. Identification of binding partner compounds of 
NgR polypeptides provides candidates for therapeutic or prophylactic intervention in 
pathologies associated with Ngjl normal and aberrant biological activity. 
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The invention also provides a method (also referred to herein as a "screening 
assay") for identifying modulators, candidate or test compounds or agents (e.g,, 
peptides, peptidomimetics, small molecules (e.g., molecules of less than 1,000 Daltons) 
or other drugs) that bind to NgR proteins or have a stunulatory- or inhibitoiy effect on, 

5 for example, NgR expression or NgR activity. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds which bind to or modulate the activity of a NgR protein or 
polypeptide or biologically active portion thereof The test compounds of the present 
invention can be obtained usmg any of the numerous approaches in combinatorial 

10 libraiy methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using afBnity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are ^plicable to peptide, 

IS non-peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12, 145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et al, (1993) Proa, Natl Acad Set USA 90, 6909; Erb 
etal, (1994) Proc. Natl Acad Scl USA 91,11422; Zuckermann etal (1994)7. Med 

20 Chem 37, 2678; Cho et al, (1993) Science 261, 1303; Carrell et al, (1994) Angew 
Chem, Int. Ed Engl 33, 2059; CareU et al, (1994) Angew Chem. Int. Ed Engl 33, 
2061; and Gallop etal, (1994) X Med Chem 37, 1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
BioTechniques 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips 

25 (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 
spores CLadner, above), plasmids (Cull et al. (1992) Proa. Natl Acad Sci. USA 89, 
1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) 
Science 249, 404-406; Cwirla etal (1990) Proc. Natl Acad Sci. USA 87, 6378-6382; 
FeUd (1991) J. Mol Biol 222, 301-310; Ladner, above). 
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1. Cell-based Assays 

The invention also provides cell-based assays to identify binding partner 
compounds of a NgR polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting aNgR polypeptide expressed on the surface 

5 of a cell with a candidate binding partner compound and detecting bmding of the 

candidate binding partner compound to the NgR polypeptide. In another embodiment, 
an assay is a cell-based assay comprising contacting a cell expressing a membrane- 
bound form of NgR protein, or a biologically active portion thereof, on the cell surface 
with a test compound and determining the ability of the test compound to modulate 

10 (e.g., stimulate or inhibit) the activity of the NgR protein or biologically active portion 
thereof 

In one embodiment, an assay is a cell*-based assay in which a cell which 
expresses a membrane-bound form of NgR protein, or a biolo^cally active portion 
thereof on the cell surface is contacted with a test compound and the ability of the test 

1 5 compound to bind to a NgR protein determined. The cell, for example, can be of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the NgR protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or enzymatic label such that binding of the test compound to the 
NgR protein or biologically active portion thereof can be determined by detecting the 

20 labeled compound in a complex. For example, test compounds can be labeled with 
^\ ^^C, or ^H, either directiy or indu-ectiy, and the radioisotope detected by 
direct counting of radioeraission or by scintillation counting. Alternatively, test 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase or hidferase, and the enzymatic label detected by determination 

25 of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of NgR protdn 
or a biolo^cally active portion thereof on the cell sur&ce with a known compound 
which binds NgR to form an assay nuxture, contacting the assay mixture with a test 
compound, and detenmning the ability of the test compound to interact with a NgR 

30 protein, wherein det^mining the ability of the test compound to interact with a NgR 
protein comprises determining the ability of the test compound to preferentially bind to 
NgR or a biologically active portion thereof as compared to the known compound. 
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Determining the ability of the test compound to modulate the activity of NgR 
or a biologically active portion thereof can be accomplished, for example, by 
determining the ability of the NgR protein to bind to or interact with a NgR target 
molecule. As used herein, a "target molecule" is a molecule with which a NgR protein 

5 binds or interacts in nature, for example, a molecule on the surface of a cell which 
expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a 
NgR protein or polypeptide of the present invention. In one embodiment, a Ngp. 

10 target molecule is a component of a signal transduction pathway that &cilitates 
transduction of an extracellular signal (e.g., a signal generated by binding of a 
compoimd to a membrane-bound NgR molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has 
catalytic activity or a protein that facilitates the association of downstream signaling 

1 S molecules with NgR. In a preferred embodiment, the detection comprises detecting a 
calcium flux or other physiological event in the cell caused by the binding of the 
molecule. 

Specific binding molecules, including natural ligands and synthetic compoimds, 
can be identified or developed using isolated or recombinant NgR products, NgR 
20 variants, or preferably, cells expressing such products. Binding partners are useful for 
purifying NgP. products and detection or quantification of NgR products in fluid and 
tissue samples using known immunological procedures. Binding molecules are also 
manifestiy usefiil in modulating (le., blocldng, inhibiting or stimulating) biological 
activities of NgR, espedally those activities involved in signal transduction. 

25 

2. Cell-free Assays 

(a) Direct binding: 

The invention includes several assay systems for identifying NgiR binding 
partn^s. In solution assays, methods of the mvention comprise the steps of (a) 
30 contacting a NgR polypeptide with one or more candidate binding partner compounds 
and (b) identifying the compoimds that bind to the NgR polypeptide. Identification of 
the compounds that bind the NgR polypeptide can be achieved by isolatmg the NgR 
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polypeptide/binding partner complex and separating the binding partner compound 
from the NgR polypeptide. An additional step of characterizing the pl^ical, biological 
and/or biochemical properties of the binding partner compound is also comprehended 
in another embodiment of the mvention. In one aspect, the NgR polypeptide/binding 

5 partner complex is isolated using an antibody immunospedfic for either tiie NgR 
polypeptide or the candidate binding partner compound. 

In still other embodiments, either the NgR polypeptide or the candidate bmding 
partner compound comprises a label or tag that facilitates its isolation, and methods of 
the invention to identify binding partner compounds include a step of isolating the NgR 

1 0 polypeptide/binding partner complex through interaction with the label or tag. An 
exen^Iary tag of this type is a poly-histidine sequence, generally around six histidine 
residues, that permits isolation of a compound so labeled using nickel chelation. Other 
labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, NY), well known 
and routinely used in the art, are embraced by the invention. 

15 

(b) Immobilized NgR 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized NgR polypeptide, or a 
biologically active fragment thereof with a candidate binding partner compound and (b) 

20 detecting binding of the candidate compound to the NgR polypeptide. In an 

alternative embodiment, the candidate binding partner compound is immobilized and 
} binding of NgR is detected. Immobilization is accomplished using any of the methods 
weU known in the art, including covalent bonding to a support, a bead or a 
chromatographic resin, as well as non-covalent, high affinity interactions such as 

25 antibody binding, or use of streptavidin/biotin binding wherein the immobiUzed 
compound includes a biotin moiety. Binding of a test compound to NgR, or 
interaction of NgR with a target molecule in the presence and absence of a candidate 
compoimd, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifiige 

30 tubes. In one embodiment, a fiision protein can be provided that adds a domain that 
allows one or both of the proteins to be bound to a matrix. For example, and not by 
way of limitation, GST-NgR fiision proteins or GST-target fiision proteins can be 
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adsorbed oato glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione deiivatized microtiter plates, that are then combined with the test 
compound or the test compomd and dther the non-adsorbed target protdn or NgR 
protein, and the mixture is incubated under conditions conducive to complex formation 

5 (e.g., at physiological conditions for salt and pH). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, and the complexes determined either directly or 
indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of NgR binding or activity determined using 

10 standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, dther NgR or its target molecule can 
be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or 
target molecules can be prepared from biotin-NHS (N-hydroxy-sucdnimide) using 

IS techniques well known in the art {e,g, , biotinylation kit. Pierce Chemicals, Rockford, 
EL), and immobilized in the wells of streptavidin-coated 96 well plates (TPierce 
Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which 
do not interfere with binding of the NgR protein to its target molecule, can be 
derivatized to the wells of the plate, and unbound target or NgR trapped in the wells 

20 by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes^ include immunodetection of 
conq)lexe5 using antibodies reactive with the NgR or target molecule, as well as 
enzyme-linked assays that rely on detecting an enzymatic activity associated with the 
NgR or target molecule. 

25 Detection of binding can be accomplished 0) using a radioactive label on the 

compound that is not immobilized, (ii) using of a fluorescent label on the 
non-immobilized compound, C^) using an antibody immunospecific for the 
non-inunobilized compound, fyi) using a label on the non-immobilized compound that 
exdtes a fluorescent support to which the immobilized confound is attached, (v) 

30 determining the activity of the Ngjl, as well as other techniques well known and 
routinely practiced in the art. 
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Detennming the activity of the target molecule, for example, may be 
accomplished by detecting induction of a cellular second messenger of the target {i.e. 
intracelliilar Ca^^, diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the 
target an appropriate substrate, detecting the induction of a reporter gene (comprising 
5 a NgR-responsive regulatoiy element operatively linked to a nucleic add encoding a 
detectable marker, e,g,, luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

(c) Competition experiments 

10 In yet another embodiment, the assay comprises contacting the NgR protein or 

biologically active portion thereof with a known compound which binds NgR to form 
an assay mixture, contacting the assay nuxture with a test compound, and determining 
the ability of the test compound to interact with a NgR protein, wherem determining 
the ability of the test compound to interact with a NgR protein comprises determining 

15 the ability of the test compound to preferentially bmd to NgR or biologically active 
portion thereof as compared to the known compound. 

In yet another embodiment, the cell-firee assay comprises contacting the NgR 
protein or biologically active portion thereof with a known compoimd which binds 
NgR to form an assay mixture, contacting the assay mixture with a test compound, and 

20 determining the ability of the test compound to interact with a NgR protein, wherein 
determining the ability of the test compound to interact with a NgR protein comprises 
determining the ability of the NgR protein to modulate the activity of a NgR target 
molecule. 

The cell-firee assays of the present invention are amenable to use of both the 
25 soluble form or the membrane-bound form of NgEL In the case of cell-free assays 
comprising the membrane-bound form of NgR, it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of NgR is maintained in 
sohition. Examples of such sohibilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecybnaltoside, octanoyl-N-methylglucamide, 
30 decanoyl-N-methylglucamide, Triton® X-100, Triton® X-1 14, Thesit®, 

IsotridecypoIy(ethylene glycol ether)^ 3-(3-cholanaidopropyl)dimethylamminiol-l- 
propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l- 
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propane sulfonate (CHAPSO), orN-dodecyl-N,N-dimethyl-3-ainmomo-l-propane 
sulfonate. 

Modulators 

5 Agents that modulate (i.e., increase, decrease, or block) NgR activity or 

expression may be identified by incubating a putative modulator with a cell containing 
a NgR polypeptide or polynucleotide and determining the ejBfect of the putative 
modulator on NgJR activity or expression. The selectivity of a compound that 
modulates the activity of NgR can be evaluated by comparing its eflfects on NgR to its 

10 eflfect on other NgR compounds. Selective modulators may include, for example, 

antibodies and other proteins, peptides or organic molecules which specifically bind to 
a NgR polypeptide or a NgR-encoding nucleic add. Modulators of NgR activity will 
be therapeutically usefiil in treatment of diseases and physiological concUtions in which 
normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and 

IS modulators may be used in the treatment of such diseases and conditions associated 
with demyelination. NgR polynucleotides and polypeptides, as well as NgR 
modulators, may also be used in diagnostic assays for such diseases or conditions. 

Methods of the invention to identify modulators inchide variations on any of 
the methods described above to identify binding partner compoimds, the variations 

20 including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the NgR polypeptide 
and the binding partner compound changes in the presence of the candidate modulator 
compared to binding in the absence of the candidate modulator compound. A 

25 modulator that increases binding between the NgR polypeptide and the binding partner 
compound is described as an enhancer or activator, and a modulator that decreases 
binding between the NgR polypeptide and the binding partner conq)ound is described 
as an inhibitor. 

In another embodiment, modulators of NgR expression may be identified in a 
30 method wherein a cell is contacted with a candidate compound and the expression of 
NgR mRNA or protein in the cell is determined. The level of expression of NgR 
mRNA or protein m the presence of the candidate compound is compared to the level 
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of expression of NgR mRNA or protein in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of NgR compression 
based on this comparison. For example, when expression of NgR mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 

5 than in its absence, the candidate compound is identified as a stimulator of NgR mRNA 
or protein expression. Alternatively, when expression of NgR mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein 
expression. The level of NgR mRNA or protein expression in the cells can be 

10 determined by methods described herein for detecting NgR mRNA or protdn. 

High Throughput Screening 

The invention also comprehends high-throughput screening (BTIS) assays to 
identify compounds that int^act with or inhibit biological activity (i.e., affect 

15 enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit 
screening of large numbers of compounds in an eflBcient manner. Cell-based HTS 
systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays 
are designed to identify "hits" or "lead compounds" having the desired property, from 
which modifications can be designed to improve the desired property. Chemical 

20 modification of the "hit" or "lead compound" is often based on an identifiable 
structure/activity relationship between the "hit" and the NgR polypeptide. 

Another aspect of the present invention is directed to methods of identifying 
compoimds that bind to dther NgR or nucleic acid molecules encoding NgR, 
comprising contacting NgR, or a nucleic add molecule encoding the same, with a 

25 compound, and determining whether the compound binds NgR or a nucleic acid 

molecule encoding the same. Binding can be determined by bindmg assays which are 
well known to the skilled artisan, including, but not limited to, gel-shift assays. 
Western blots, radiolabeled competition assay, phage-based expression cloning, 
co-fi-actionation by chromatography, co-predpitation, cross linking, interaction 

30 trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, which are 

described in, for example, Adsabd etal. (Eds.),CURREOTPROT(XOLS IN MOLECULAR 
BIOLOGY, 1999, John W^iley & Sons, NY, which is mcorporated herein by reference in 
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its entirety. The NgR proteins, for example, can be used as "bait protdns" in a two- 
hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos etaL, 
(1993) Cell 72, 223-232; Madura et aL, (1993) J, BioL Chenu 268, 12046-12054; 
Bartel et aL, (1993) BioTechniques 14, 920-924; Iwabuchi et al, (1993) Oncogene 8, 

5 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact 
with NgR ('TSTgR-binding proteins" or "NgR-bp") and modulate NgR activity. Such 
NgR-binding proteins are also likely to be mvolved in the propagation of signals by the 
NgR proteins as, for example, upstream or downstream elements of the NgR pathway. 
Other assays may be used to identify specific ligands of a NgR receptor, 

10 including assays that identify ligands of the target protein through measuring direct 
binding of test ligands to the target protdn, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirecdy using the yeast two-hybrid system described in 

15 Fields et al, (1989) Nature 340, 245-246, and Fields et al, (1994) Trends Genet 10, 
286-292, both of which are incorporated herein by reference. The two-hybrid system 
is a genetic assay based on the modular nature of most transcription factors used for 
detecting interactions between two proteins or polypeptides. It can be used to identify 
proteins that bind to a known protein of interest, or to delineate domains or residues 

20 critical for an interaction. Variations on this methodology have been developed to 
clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system e)q)loits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with 
a DNA binding domain that binds to an upstream activation sequence (UAS) of a 

25 reporter gene, and is generally performed in yeast. The assay requires the construction 
of two hybrid genes encodmg (1) a DNA-binding domain that is fiised to a first protdn 
and (2) an activation domain fiised to a second protdn. The DNA-binding domain 
targets the first hybrid protein to the UAS of the reporter gene; however, because most 
proteins lack an activation domain, this DNA-binding hybrid protein does not activate 

30 transcription of the reporter gene. The second hybrid protein, which contains the 

activation domain, cannot by itself activate expression of the reporter gene because it 
does not bind the UAS. However, when both hybrid proteins are present, the 
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noncovalent interaction of the first and second proteins tethers the activation domain 
to the UAS, activating transcription of the reporter gene. For example, when the first 
protein is a NgR gene product, or fi-agment thereof, that is known to interact with 
another protein or nucleic acid, this assay can be used to detect agents that interfere 

5 with the bindmg interaction. Expression of the reporter gene is monitored as diflferent 
test agents are added to the system. The presence of an inhibitory agent results in lack 
of a reporter signal. The compounds to be screened inchide (which may include 
compounds that are suspected to bind NgjR, or a nucleic acid molecule encoding the 
same), but are not Umited to, extracellular, intracellular, biolo^cal or chemical origin. 

1 0 The fimction of the NgR gene product is imclear and no ligands have yet been 

found which bind the gene product. The yeast two-hybrid assay is usefiil to identify 
proteins that bind to the gene product. In an assay to identify proteins that bind to a 
NgR receptor, or fi-agment thereof, a fiision polynucleotide encodmg both a NgR 
receptor (or firagment) and a UAS binding domain (/.e., a first protein) may be used. 

15 In addition, a large number of hybrid genes each encoding a different second protein 
fiised to an activation domain are produced and screened in the assay. Typically, the 
second protein is encoded by one or more members of a total cDNA or genomic DNA 
fusion library, with each second protein-coding, region bemg fijsed to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not even 

20 necessary to know the identity or fimction of the second binding protein. The system 
•is highly sensitive and can detect interactions not revealed by other methods; even 
transient interactions may trigger transcription to produce a stable mRNA that can be 
repeatedly translated to yield the reporter protein. 

Other assays may be used to search for agents that bmd to the target protein. 

25 One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that protdns generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the 

30 target protein), the target protein molecule bound by the ligand remains in its folded 
state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absCTce of a ligand. Binding of 
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the ligand to the target protein can be determined by any method which distinguishes 
between the folded and unfolded states of the target protem. The fimction of the 
target protein need not be known in order for this assay to be performed. Virtually any 
agent can be assessed by this method as a test ligand, including, but not limited to, 
5 metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

Another method for identifying ligands of a target protein is described in 
Wieboldtera/. (1997) AnaL Chem, 69:1683-1691, incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 

10 phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple m^brane washing. The 
specifically selected molecules that are retained on the filter are subsequently liberated 
firom the target protem and analyzed by HPLC and pneumatically assisted electrospray 
(ion spray) ionization mass spectroscopy. This procedure selects library components 

1 S with the greatest affinity for the target protdn, and is particularly usefiil for small 
molecule libraries. 

The methods of the invention also embrace ligands, especially neuropeptides, 
that are attached to a label, such as a radiolabel (e.g., ^^S, ^^P, a 
fluorescence label, a chemiluminescent label, an enzymic label and an immimogenic 

20 label. Modulators falling within the scope of the invention include, but are not limited 
to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric 
effectors, and peptides. The NgR polypeptide or polynucleotide enq)loyed in such a 
test may either be fi-ee in solution, attached to a solid support, borne on a cell surface 
or located intracellular^ or associated with a portion of a cell. One skilled in the art 

25 can, for exan:q)le, measure the formation of complexes between Ng^ and the 
compoimd being tested. Alternatively, one skilled in the art can examine the 
diminution in complex formation between NgR and its substrate caused by the 
compound being tested. 

Another aspect of the present invention is directed to methods of identifying 

30 compounds which modulate (/.e., increase or decrease) activity of NgR comprising 
contacting NgR with a compound, and determining whether the compoimd modifies 
activity of NgR. The activity in the presence of the test compared is measured to the 
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activity in the absence of the test compound. Where the activity of the sanq)Ie 
containing the test conq)ound is higher than the activity in the sample lacking the test 
compound, the compound will have increased activity. Similarly, where the activity of 
the sample containing the test compound is lower than the activity in the sample 
5 lacking the test compoimd, the compound will have inhibited activity. 

The present invention is particularly usefiil for screening compounds by using 
NgR in any of a variety of drug screening techniques. The compounds to be screened 
incliide (which may include compounds which are suspected to modulate NgR 
activity), but are not Umited to, extracellular, intracellular, biologic or chemical origin. 

1 0 The NgR polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellularly. 
One skilled in the art can, for example, measure the formation of complexes between 
NgR and the compound being tested. Alternatively, one skilled in the art can examine 
the diminution in complex formation between Nogo-R and its substrate caused by the 

1 5 compound being tested. 

The activity of NgR polypeptides of the invention can be determined by, for 
example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of the NgR can be assayed by examining 
their ability to bind calcium ions, hormones, chemokines, neuropeptides, 

20 neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity 
of the NgR can be determined by examining the activity of effector molecules 
including, but not limited to, adenylate cyclase, phosphoHpases and ion channels. 
Thus, modulators of NgR activity may alter a NgR receptor fimctioii, such as a binding 
property of a receptor or an activity. In various embodiments of the method, the assay 

25 may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP 
assay such as a [^^S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a 
diacylglycerol assay, an Aequorin assay, a Ludferase assay, a FUPR assay for 
intracellular Ca^^ concentration, a mitogenesis assay, a MAP Kinase activity assay, an 
arachidonic acid release assay (e.g., using [^-aracMdonic add) and an assay for 

30 extracellular acidification rates, as well as other binding or fimction-based assays of 
NgR activity that are generally known in the art. NgR activity can be determined by 
methodologies that are used to assay for FaRP activity, which is well known to those 
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skilled in the art. Biological activities of NgR receptors according to the invention 
include, but are not limited to, the bmding of a natural or an unnatural ligand, as well 
as any one of the functional activities of NgRs known in the art. Non-limiting 
examples of NgR activities include transmembrane signaling of various forms, which 

5 may involve phosphatidylinositol (PI) association and/or the exertion of an influence 
over PI; another exemplary activity of NgRs is the binding of accessory proteins or 
polypeptides that differ from known GPI proteins. 

The modulators of the invention exhibit a variety of chemical structures, which 
can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, 

10 peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may 
fimction as activators or inhibitors (competitive, uncompetitive and non-competitive) 
(eg:, antibody products) of NgR receptors. The invention does not restrict the 
sources for suitable modulators, which may be obtained from natural sources such as 
plant, animal or mineral extracts, or non-natural sources such as small molecule 

15 libraries, including the products of combinatorial chemical approaches to library 
construction, and peptide libraries. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the Uke, which are 
described in, for example, ENZYME Assays: A Practical Approach, Eisenthal and 

20 Danson (Eds.), 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs in drug discovery programs is well-known; assays capable 
of testing thousands of imknown compounds per day in high-throughput screens 
(HTSs) are thoroughly documented. The literature is replete with examples of the use 

25 of radiolabelled ligands in HTS binding assays for drug discovery (see Williams (1991) 
Med Res. Rev., 1 1, 147-184; Sweetnam et al., (1993) J. Nat. Prod 56, 441-455 for 
review). Recombinant receptors are preferred for binding assay HTS because they 
allow for better specificity (higher relative purity), provide the ability to generate large 
amounts of receptor material, and can be used in a broad variety of formats (see 

30 Hodgson (1992) Bio/Technology 10, 973-980; each of which is incorporated herem by 
reference in its entirety). 
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A variety of heterologous systems is available for fimctionai expression of 
recombinant receptors that are well known to those skilled in the art. Such systems 
mclude bacteria (Strosberg et al (1992) Trends Pharmacol Sci. 13, 95-98), yeast 
(Pausch (1997) Trends Biotechnol 15, 487-494), several kinds of insect cells (Vanden 

5 Broeck (1996) Int Rev, Cytol 164, 189-268), amphibian cells (Jayawickreme et al 
(1997) Curr, Opin. Biotechnol 8, 629-634) and several mammaUan cell lines (CHO, 
HEK293, COS, etc.; see Gerhardt etal (1997) Eur, J. Pharmacol 334, 1-23). These 
examples do not preclude the use of other possible cell expression systems, including 
cell lines obtained from nematodes (PCT application WO 98/37177). ' 

10 In preferred embodiments of the invention, methods of screening for 

compounds which modulate NgR activity comprise contacting test compoimds with 
NgR and assaymg for the presence of a complex between the compound and NgR. In 
such assays, the ligand is typically labeled. After suitable incubation, free ligand is 
separated from that present in boimd form, and the amount of free or uncomplexed 

15 label is a measure of the ability of the particular compound to bind to NgR. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding aJBfinity to NgR is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid substrate. 
The peptide test compoimds are contacted with NgR and washed. Bound NgR is then 

20 detected by methods well known in the art. PuriiBed polypeptides of the invention can 
also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the protein 
and immobilize it on the solid support. 

Generally, an expressed NgR can be used for HTS binding assays in 

25 conjunction with its defined ligand. The identified peptide is labeled with a suitable 
radioisotope, including, but not limited to, ^\ ^H, ^^S or ^^P, by methods that are 
well known to those skilled in the art. Ahematively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al (1994) Drug 
Dev. Res. 33, 373-398; Rogers (1997) Dn/gDiscov. Today!, 156-160). Radioactive 

30 ligand spedfically bound to the receptor in membrane preparations made from the cell 
line expressing the recombinant protein can be detected in HTS assays in one of 
several standard ways, including filtration of the receptor-ligand complex to separate 
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bound ligand from unbound ligand (WUiams (1991) Mei Res. Rev. 11, 147-184; 
Sweetnam et al (1993) J. Nat. Prod 56, 441-455). Alternative methods include a 
scintillation proximity assay (SPA) or a FlashPlate format in which such separation is 
unnecessary (Nakayama (1998) Cwrr. Opin. Drug Disc. Dev. 1, 85-91 Boss6 et al 
5 (1998) J. BiomoL Screening 3, 285-292). Binding of fluorescent ligands can be 
detected in various ways, mcluding fluorescence energy transfer (FRET), direct 
spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers {1991) Drug Discov. Today 2, 156-160; Hill (1998) Curr. Opin. Drug Disc. 
Dev. 1, 92-97). 

1 0 Examples of such biolo^cal responses include, but are not limited to, the 

following: the ability to survive in the absence of a limiting nutrient in specifically 
engineered yeast cells (Pausch (1997) Trends in Biotechnol 15, 487-494); changes in 
intracellular Ca^"*" concentration as measured by fluorescent dyes (Murphy et al. (1998) 
Cur. Opin. Drug Disc. Dev. 1, 192-199). Fluorescence changes can also be used to 

15 monitor Ugand-induced changes in membrane potential or intracellular pH; an 

automated system suitable for HTS has been described for these purposes (Schroeder 
etal (1996) J. Biomol Screening 1, 75-80). Melanophores prepared ^om Xenopus 
laevis show a ligand-dependent change in pigment organization in response to 
heterologous NgR activation; this response is adaptable to HTS formats 

20 (Jayawickreme etal (1997) Ciar. Opin. Biotechnol 8, 629-634). Assays are also 
available for the measurement of common second messengers, including cAMP, 
phosphoinosilides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the ability 

25 to transduce the signal for the binding of Nogo in membranes prepared from these cells 
through the putative GPI anchor In another embodiment of the invention, 
permanently transfected CHO cells could be used for the preparation of membranes 
which contain sigmficant amounts of the recombinant receptor proteins; these 
membrane preparations would then be used m receptor binding assays, emplo^g the 

30 radiolabelled ligand specific for the particular receptor. Alternatively, a fiinctional 
assay, such as fluorescent monitoring of ligand-induced changes in internal Ca 
concentration or membrane potential in permanently transfected CHO cells containing 
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each of these receptors individually or in combination would be preferred for HTS. 
. Equally preferred would be an alternative type of mammalian cell, such as HEK293 or 
COS cells, in similar formats. More preferred would be permanently transfected insect 
cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant 
5 yeast cells expressing the Drosophila melanogaster receptors in HTS formats well 
known to those skilled in the art (e.g., Pausch (1997), above). 

The invention contemplates a multitude of assays to screen and identify 
inhibitors of ligand binding to NgR receptors. In one example, the NgR receptor is 
immobilized and interaction with a bmding partner is assessed in the presence and 

10 absence of a candidate modulator such as an inhibitor compoimd. In another example, 
interaction between the NgR receptor and its binding partner is assessed in a solution 
assay, both in the presence and absence of a candidate inhibitor compound. In either 
assay, an inhibitor is identified as a compound that decreases binding between the NgR 
receptor and its binding partner. Another contemplated assay involves a variation of 

15 the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by 
detection of a positive signal in a transformed or transfected host cell, as described in 
PCT publication number WO 95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected fi*om libraries of either potential activators or potential inhibitors. There are a 

20 number of different libraries used for the identification of small molecule modulators, 
including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random ch^cal structures, some of lAdiich are analogs of 
known compounds or analogs of compounds that have been identified as **hits" or 

25 "leads" in other drug discovery screens, some of which are derived fi'om natural 
products, and some of whidi arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms that are used to create mixtures for screening by: (1) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extmction of 

30 plants or marine organisms. Natural product libraries include polyketides, 

non-ribosomal peptides, and variants (non-naturally occurring) thereof For a review, 
see Cane et al,. Science (1998) 282, 63-68. Combinatorial libraries are composed of 
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large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 
These libraries are relatively easy to prepare by traditional automated synthesis 
methods, PGR, cloning, or proprietary synthetic methods. Of particular interest are 
non-peptide combinatorial libraries. Still other libraries of interest include peptide, 

5 protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and 
polypeptide libraries. For a review of combinatorial chemistry and libraries created 
therefrom, see Myers (1997) Curr. Opin Biotechnol 8, 701-707. Identification of 
modulators through use of the various libraries described herein permits modification 
of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 

10 activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses 
non-peptide modulators, as weQ as such peptide modulators as neuropeptides other 

IS than natural ligands, antibodies, antibody fragments, and modified compoimds. 

comprising antibody domains that are immunospecific for the expression product of the 
identified NgR gene. 

Other embodiments of the invention comprise using competitive screening 
assays m which neutralizing antibodies capable of bindiag a polypeptide of the 

20 invention specifically compete with a test compound for binding to tiie polypeptide. In 
this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with NgR. Radiolabeled competitive 
binding studies are described in Lin et oL^ (1997) Antimicrob. Agents Chemother, 41, 
2127-213 1, the disclosure of which is incorporated herein by reference in its entirety. 

25 In other embodiments of the invention, the polypeptides of the invention are 

enq)loyed as a research tool for identification, characterization and purification of 
interacting, regulatoiy protems. Appropriate labels are incorporated into the 
polypeptides of the invention by various methods known in the art and the 
polypeptides are used to capture interacting molecules. For example, molecules are 

30 incubated with the labeled polypeptides, washed to remove unbound polypeptides, and 
the polypeptide complex is quantified. Data obtained using different concentrations of 
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polypeptide are used to calculate values for the number, affinity, and association of 
polypeptide with the protein complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. 

5 In one embodiment of aflSnity purification, a polypeptide is covalently coupled to a 
chromatography colimm. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue 
of their affinity to the polypeptide. The polypeptide-complex is recovered fi*om the 
column, dissociated and the recovered molecule is subjected to protein sequencing. 

10 This amino add sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotides for cloning the corresponding gene from an appropriate 
cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties to 
the ligand for the NgR of the invention, but which are smaller and exhibit a longer half 

IS time than the endogenous ligand in a human or animal body. When an organic 
compound is designed, a molecule according to the invention is used as a "lead" 
compound. The design of mimetics to known pharmaceutically active compounds is a 
well-known approach in the development of pharmaceuticals based on such "lead" 
compounds. Mimetic design, synthesis and testing are generally used to avoid 

20 randomly screening a large number of molecules for a target property. Furthermore, 
structural data deriving firom the analysis of the deduced amino acid sequences 
encoded by the DNAs of the present invention are usefiil to design new drugs, more 
specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 

25 sequmces present in all the available databases showed a significant homology with the 
transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of NgR can be 

30 designed. 

This invention finther pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 
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Compositions and Pharmaceutical Compositions 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
5 oral or parenteral administration. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein, **pharmacwtically acceptable carrier" is intended to include 

10 any and all solvents, dispersion media, coatings, antibacterial and antifimgal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The nature of the carrier or other ingredients will depend on the 
specific route of administration and particular embodiment of the invention to be 
administered. Examples of techniques and protocols that are usefiil in this context are, 

15 inter alia , found m Remington's PHARMACEUTICAL SCIENCES, 16th ed., (1980) Osol, 
A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples 
of such carriers or diluents include, but are not limited to, water, saline. Ringer's 
solution, dextrose solution and 5% human serum albumin. Liposomes and 
non-aqueous vehicles such as fixed oils may also be used. The use of such media and 

20 agents for pharmaceutically active substances is well known in the art. Except insofiir 
as any conventional media or agent is incompatible with the active compound, use 
thereof in the compositions is contemplated. Supplementary active compounds can 
also be incorporated iato the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 

25 with its intended route of administration. Examples of routes of administration include 
oral and parenteral (e.g., intravenous, intradennal, subcutaneous, inhalation, 
transdermal (topical), transmucosal and rectal administration). Solutions or 
suspensions used for parenteral, intradermal or subcutaneous application can include 
the following components: a sterile dihient such as water for injection, saline solution, 

30 fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic add or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
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acid; bu£fer5 such as acetates, dtrates or phosphates, and agents for the adjustment of 
tonidty such as sodium chloride or dextrose. The pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can 
be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
5 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile iiyectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

10 water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). 
In all cases, the compo^tion must be sterile and should be fluid to the extent that easy 
syringeability exists. It must be stable under the conditions of manu&cture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fiingi. The carrier can be a sohrent or dispersion medium containing, for 

15 example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid 
polyethylene glycol, and the like), and smtable mixtures thereof The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maiatenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 

20 antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 

ascorbic add, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
diloride in the composition. Prolonged absorption of the injectable compositions can 
be brought about by including in the composition an agent which delays absorption, for 

25 exanq)le, alimunum monostearate and gdatin. 

Sterile injectable solutions can be prepared by mcorporating the active 
compound (e.g,, a NgR protein or anti-NgiR antibody) in the required amount m an 
appropriate solvent with one or a combination of ingredirats enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 

30 incorporating the active compound into a sterile vehicle that contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case 
of sterile powders for the preparation of sterile injectable solutions, methods of 
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prq)aration are vacuum drying aad freeze-drying that yields a powder of the active 
ingredient plus any additional desired mgredient from a previously sterile-filtered 
solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They 

5 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and expectorated or swallowed. 

10 Pharmaceutically compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the foUowmg ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as al^c add, Primogel or com starch; a lubricant 

IS such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
' sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable 

20 propellant, e.g., a gas such as carbon dioxide or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulatioa Such penetrants are generally known in 
the art, and inchide, for example, for transmucosal administration, detergents^ bile 

25 salts, and fusidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active conqpounds are formulated into ointments, salves, gels or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
30 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 

5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including Uposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as phannaceutically 

10 acceptable carriers. These can be prq}ared according to methods known to those 
skilled m the art, for example, as described in U.S. Patent No. 4,522,81 1 . It is 
espedally advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and imiformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 

15 treated; each imit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved. 

20 The nucleic acid molecules of the mvention can be inserted mto vectors and 

used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by 
any of a number of routes, e.g., as described in U.S. Patent No. 5,703,055. Delivery 
can thus also include, e,g., intravenous injection, local administration (see U.S. Patent 
No. 5,328,470) or stereotactic injection (see e.g., Chen etal. (1994) Proc. Natl. Acad 

25 Set USA 91, 3054-3057). The pharmaceutical preparation of the gene ther^y vector 
can include the gene therapy vector in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from recombinant cells, e.g, 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 

30 produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack or 
dispenser together with instructions for administration. 
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The dosage of these low molecular wdght compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 
For treating human or animals, between approximately 0.5 mg/kg of body weight to 
5 500 mg/kg of body weight of the compound can be adnunistered. Therapy is typically 
administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

Another aspect of the present invention is the use of the NgR nucleotide 
sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, 

10 induding but not limited to humans and other mammals and invertebrates. Any of the 
nucleotide sequences disclosed herem, or any portion thereof can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 

15 preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with NgR sequences can be identified. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 

20 herein, have a variety of pharmaceutical applications and may be used, for example, to 
treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In 
a particular embodiment, the present molecules are used in gene therapy. For a review 
of gene therapy procedures, see e,g, Anderson Science (1992) 256, 808-813, which is 
incorporated herein by reference in its entirety. 

25 The present invention also encompasses a method of agonizmg (stimulating) or 

antagonizing a NgR natural binding partner assodated activity in a mammal comprising 
admimstering to ssid mammal an agonist or antagonist to one of the above disclosed 
polypeptides in an amount sufficient to efifect said agonism or antagonism. One 
embodiment of the preset invention, then, is a method of treating diseases in a 

30 piammal with an agonist or antagonist of the protein of the present invention 
comprising admimstering the agonist or antagonist to a mammal in an amount 
suffident to agonize or antagonize NgR-assodated fimctions. 
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Methods of determining the dosages of compounds to be administered to a 
patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996, and International patent 
publication number WO 96/22976, published August 1, 1996, both of which are 

5 incorporated herein by reference in their entirety, including any drawings, figures or 
tables. Those sidlled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition 

10 of the patient. Therapeutically effective doses for the compounds described herein can 
be estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
takes into account the ICsq as determined in cell culture assays. The animal model data 
can be used to more accurately determine usefiil doses in humans. 

1 5 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be detenmned to fadlitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compoimds can be determined using detection 

20 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharmacokinetic characteristics, can be 
optimized by altering the chemical structure and retesting. In this regard, conq>oxmds 
displaying good pharmacokinetic characteristics can be used as a model. 
Toxicity studies can also be carried out by measuring the blood cell 

25 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: (1) the compound is administered to mice (an untreated control 
mouse should also be used); (2) blood samples are periodically obtahied via the tail 
vein fi-om one mouse in each treatment group; and (3) the samples are analyzed for red 
and white blood cdl counts, blood cell composition and the percent of lymphocytes 

30 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 
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At the tennination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guideUnes Report of the American Veterinary Medical Assoc. 
Panel on Euthanasia, (1993) 1 Am. Vet Med. Assoc. 202:229-249). Representative 
5 animals fi-om each treatment group can then be examined by gross necropsy for 

immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in 
tissue are noted and tissues are examined histologically. Compounds causing a 
reduction in body weight or blood components are less preferred, as are compounds 
having an adverse effect on major organs. In general, the greater the adverse effect the 
less preferred the compound. 

For the treatment of cancers the ^ected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to SOO mg/day, preferably 1 to 250 mg/day, and 
most preferably 1 to SO mg/day. Drugs can be delivered less firequently provided 
plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 
Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

NgR mRNA transcripts have been found in the brain and heart. SEQ ID NOs: 
1 and/or, 3 will, as detailed above, enable screening the endogenous 
neurotransmitters/hormones/ligands which activate, agonize, or antagonize NgR and 
for compounds with potential utility in treating disorders including CNS disorders 
{e.g., stroke) and degenerative disorders such as those associated with demyelination. 

For example, NgR receptor activation may mediate the prevention of neurite 
outgrowth. InUbitionwoidd be benefidal in both chrome and acute brain injiny. See, 
e.g., Donovan ei cd., (1997) X Neuposci. 17, 5316-5326; Turgeon et aL, (1998) J. 
Neurosci. 18, 6882-6891; Smith-Swintosky e^oi, (1997) J. Neurochem. 69, 
1890-1896; GiU^/ a/., (199S) Bram Res, 797, 321-327; Smdmetal., {\996)Semhu 
Thromb. Hemost. 22, 125-133. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory eflFect on NgR 
activity {e.g,, NgR gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) 

5 disorders (e.g., a disease condition such as a demyelination disorder) associated with 
aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics 
(Le., the study of the relationship between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 

10 failure by altering the relation between dose and blood concentration of the 

pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
biased on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to detennine appropriate dosages and therapeutic regimens. 

1 5 Accordingly, the activity of NgiR protein, expression of NgR nucleic add or mutation 
content of NgR genes in an indi\ddual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual, 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

20 persons. See e.g., Eichdbaum (1996) Clin. Exp. Pharmacol Physiol 23, 983-985 and 
Linder (1997) Clin. Chem. 43, 254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single &ctor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single Actors altering the way the body acts on drugs (altered drug 

25 metabolism). These pharmacogenetic conditions can occur either as rare defects or as 
polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) defidency 
is a conmion inherited enzymopathy in which the nuun clinical complication is 
haemolysis after ingestion of o?ddant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofiirans) and consimiption of &va beans. 

30 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 
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(NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an 
e3q)lanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxidty after taking the standard and safe dose 
of a drug. These polymorphisms are expressed in two phenotypes in the population, 

5 the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 
different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of fimctional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently e^erience exaggerated drug response and side effects when they 

10 receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 

IS amplification. 

Thus, the activity of NgR protein, expression of NgR nucleic add, or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 

20 encoding drug-metabolizing enzymes to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied to dosing or drug selection, 
can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or 
prophylactic effidency when treating a subject with a NgR modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

25 

Monitoring Clinical Efficacy 

Monitoring the influence of agents dnigs, compoimds) on the expression 
or activity of NgR (e.g:, the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only m basic drug screening, but also in clinical 
30 trials. For example, the efiectiveness of an agent determined by a screening assay as 
described herein to increase NgR gene expression, protem levels or upregulate NgR 
activity, can be monitored in dinical trials of subjects exhibiting decreased NgR gene 
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expression, protein levels, or downreguiated NgR activity. Alternatively, the 
effectiveness of an agent detennined by a screening assay to decrease NgiR gene 
expression, protein levels, or downregulate NgR activity, can be monitored in clinical 
trials of subjects exhibiting increased NgR gene expression, protein levels, or 
5 upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, 
preferably, other genes that have been implicated in, for example, a disease or disorder, 
can be used as a "read out" or markers of the immime responsiveness of a particular 
cell. 

For example, genes, including NgR, that are modulated in cells by treatment 
10 with an agent compound, drug or small molecule) that modulates NgR activity 
(e.g,, identified in a screemng assay as described herein) can be identified. Thus, to 
study the effect of agents on demyelination disorders, for example, in a clinical trial, 
cells can be isolated and RNA prepared and analyzed for the levels of expression of 
NgR and other genes implicated in the disorder. The levels of gene expression a 
15 gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced by one 
of the methods as described herein or by measuring the levels of activity of NgR or 
other genes. In this v^ay, the gene expression pattern can serve as a marker, indicative 
of the physiological response of the cells to the agent. Accordingly, this response state 
20 may be determined before, and at various points during, treatment of the individual 
with the agent. 

In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate 

25 identified by the screening assays described herem) comprising the steps of (/) 

obtaining a pre-admmistration sample fi-om a subject prior to administration of the 
agent; (jT) detecting the level of expression of a NgR protdn, mRNA, or genomic 
DNA in the preadministration sample; (///) obtaining one or more post-administration 
samples firom the subject; (iv) detecting the level of expression or activity of the NgR 

30 protein, mKNA, or genomic DNA in the post-admuaistration samples; (v) comparing 
the level of expression or activity of the NgR protein, mRNA or genomic DNA in the 
pre-administration sample with the Ng|R protein, mRNA or genomic DNA in the post 
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administration sample or samples; and (yf) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the e?q)ression or activity of NgiR to higher levels than detected, 
/.e., to increase the eflfectiveness of the agent. Alternatively, decreased administration 
5 of the agent may be desirable to decrease expression or activity of NgR to lower levels 
than detected, i.e., to decrease the effectiveness of the agent. 



Methods of Treatment 

The present mvention provides for both prophylactic and therapeutic methods 

10 of treating a subject at risk of (or susceptible to) a disorder or havmg a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not sufiTering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (/.e., reduce or inhibit) activity. Therapeutics that 

IS antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to; (0 a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (if) antibodies to a NgR 
peptide; (m) nucleic acids encoding a NgR peptide; (/v) administration of antisense 
nucleic acid and nucleic acids that are **dysftmctional" {i.e., due to a heterologous 

20 insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
recombmation (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators 
(i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 

25 between a NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (/.e., are agonists to) activity. Therapies that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 

30 Therap^tics that may be utilized inchide, but are not Umited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 
bioayailabiUty. 
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Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mKNAs of a NgR peptide). Methods that are well-known within the art 

5 include, but are not limited to, immunoassays (e,g,, by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g.. Northern assays, dot blots, in situ hybridization, etc.). 
In one aspect, the invention provides a method for preventing, in a subject, a 

10 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates Ng^ expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combmation of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

IS agent can occur prior to the manifestation of symptoms characteristic of the NgjR. 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

20 Another aspect of the invention pertains to methods of modulating NgR 

expression or activity for therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 

25 protein, a naturally-occurring cognate ligand of a Ng^ protein, a peptide, a NgR 

peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more NgiR protein activity. Exanq)les of such stimulatory agents include active 
Ng^ protein and a nucleic add molecule encoding Ng^ that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 

30 Examples of such inhibitory agents inchide antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
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agent to a subject). As such, the present invention provides methods of treating an 
individual aGQicted with a disease or disorder characterized by aberrant egression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent {e.g,^ an agent identified by a screening assay described 
5 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR egression or activity. 

10 Gene Therapy 

Mitations in the NgR gene that result in loss of normal function of the NgR 
gene product underlie NgR human disease states. The invention comprehends gene 
therapy to restore NgR activity to treat those disease states. Delivery of a fimctional 
NgR gene to appropriate cells is effected ex v/vo, in situ^ or in vivo by use of vectors, 

IS and more particularly viral vectors (e.g. , adenovirus, adeno-associated virus, or a 
retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or 
chemical treatments). See, for example, Anderson (1998) Nature^ supplement to 
392(6679):25-20. For additional reviews of gene therapy technology see Friedmann 
(1989) Science 244, 1275-1281; Verma (1990) Sci, Am, 68-84; and MiUer (1992) 

20 Nature 357, 455-460. Alternatively, it is contemplated that in other himian disease 
states, preventing the expression o^ or inhibiting the activity of, NgR will be usefiil in 
.'treating disease states. It is contemplated that antisense therapy or gene therapy could 
be applied to negatively regulate the e3q)ression of NgR 

The present invention provides for both prophylactic and ther^eutic methods 

25 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering fi'om the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (/.e., reduce or inhibit) activity. Therapwtics that 

30 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (/) a NgR polypeptide, 
or analogs, derivatives, firagments or homologs thereof, (//) antibodies to a NgR 
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peptide; (iii) nudeic acids encoding a NgR peptide; (iv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 
insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 

5 recombination (see, e,g., Capecchi (1989) , above); or (v) modulators (i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or 
antibodies specific to a peptide of the invention) that alter the interaction between a 
NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 

10 not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (/.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 

15 bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 

20 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

inomunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis^ immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mKNAs (e.g.^ Northern assays, dot blots, in situ hybridization, etc.). 
In one aspect, the invention provides a method for preventing, in a subject, a 

25 disease or condition associated with an aberrant Ng|R e?q)res^on or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Adnunistration of a prophylactic 

30 agent can occur prior to the manifestation of symptoms characteristic of the NgR 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgP. aberrancy, for example, a NgR agonist or 
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KgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The modulatory method of the 
5 invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 

10 or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protem and a nucleic acid molecule encoding NgR that has been mtroduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protdn activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro {e!g, , by 

15 culturing the cell with the agent) or, alternatively, in vivo {e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 

20 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
admmistering a NgR protein or nudeic add molecule as therapy to compensate for 
reduced or aberrant NgR e7q)residon or activity. 

The present invention is not to be limited in scope by the spedfic embodiments 

25 described herein. Indeed, various modifications of the invention in addition to those 
described herem will become apparent to those skilled in the art &om the foregoing 
description and accompan^ng figure. Such modifications are intended to &11 within 
the scope of the appended claims. 

The following Table 5 contains the sequmces of exemplary polynucleotides 

30 and polypeptides of the invention. 
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TABLES 

The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: 



ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCTCCTGCTGATG 

CTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGCTCTGCACCTGCTACTCATCC 

CCGCCCACCGTGAGCTGCCAGGCCAACAACTTCTCCTCTGTGCCGCTGTCCCTGCCACCC 

AGCACTCAGCGACTCTTCCTGCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCirr 

GGGTCCAACCTGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC 

ACTTTCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCGGCACCTGCGC 

TCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGCAGTCGCTGCATTTGTACCGC 

TGCCAGCTCAGCAGCCTGCCCGGCAACATCTTCCGAGGCCTGGTCAGCCTGCAGTACCTC 

TACCTCCAGGAGAACAGCCTGCTCCACCTACAGGATGACTTGITCGCGGACCTGGCCAAC 

CTGAGCCACCTCTTCCTCCACGGQAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC 

GGCCTGGGCAGCCTGGACCGGCTGCTGCTGCACGGGAACCGGCTGCAGGGCGTGCACCGC 

GCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACCTGTTCAACAACAGCCTG^ 

TCGCTGCCCGGCGAGGCGCTCGCCGACCTGC(XrrCGCTCGAGTTCCTGCGGCTCAACGCT 

AACCCCTGGGCGTGCGACTGCCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGC 

GTGTCCAGCTCCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC 

GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCGGCCGGGCAGC 

CGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGGTGGCCGAGGCCGGGGCGCCC 

CCAGCCGATCCCTCCACCCTCTACCGAGATCTGCCTGCCGAAGACTCGCGGGGGCGCCAG 

GGCGGGGACGCGCCTACTGAGGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGA 

GGGGAGCAGATGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG 

CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCCCCACCACCTC 



The following amino add sequence <SEQ ID NO. 2> is the predicted amino add 

sequence derived fi-om the DNA sequence of SEQ ID NO. 1 : 

MLPGLRRLLQAPASACLLLMLLALPLAAPSC 

PMLCTCYSSPPTVSCQANNFSSVPLSLPPST 

QRLFLQNNLIRTLRPGTFGSNLLTLWLFSNN 

LSTIYPGTFRHLQALEELDLGDNRHLRSLEP 

DTFQGLERLQSLHLYRCQLSSLPGNIFRGLV 

SLQYLYLQENSLLHLQDDLFADLANLSHLFL . 

HGNRLRLLTEHVFRGLGSLDRLLLHGNRLQG 

VHRAAFRGLSRLTILYLFNNSLASLPGEALA 

DLPSLEFLRLNANPWACDCRARPLWAWFQRA 

RVSSSDVTCATPPERQGRDLRALREADFQAC 

PPAAPTRPGSRARGNSSSNHLYGVAEAGAPP 

ADPSTLYRDLPAEDSRGRQGGDAPTEDDYWG 

GYGGEDQRGEQMCPGAACQAPPDSRGPALSA 

GLPSPLLCLLLLVPHHL 
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The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: 

ATGTCTTGGCAGTCTGGAACCACAGTGACACAATCTCCCGTGCAGGCTGCTCAGGTCTCA 

GGGTGCTGTGTGGAATTGCTGCTGTTGCTGCTCGCTGGAGAGCTACCTCTGGGTGGTGGT 

TGTCCTCGAGACTGTGTGTGCTACCCTGCGCCCATGACTGTCAGCTGCCAGGCACACAAC 

TTTGCTGCCATCCCGGAGGGCATCCCAGAGGACAGTGAGCGCATCTTCCTGCAGAACAAT 

CGCATCACCTTCCTCCAGCAGGGCCACTTCAGCCCCGCCATGGTCACCCTCTGGATCTA^ 

TCC^CAACATCACTTTCATTGCTCCCAACACCTTCGAGGGCTTTGTG^ 

CTAGACCTrGGAGACAACCGACAGCTGCGAACGCTGGCACCCGAGACCTTCCAAGTC^ 

GTGAAGCTTCACGCCCTCTACCTCTATAAGTGTGGACTGAGCGCCCTGCCCGCAGGCATC 

TTrGGTGGCCTGCACAGCCTGCAGTATCTCTACTTGCAGGACAACCATATCGAGTACCTC 

CAAGATGACATCTTTGTGGACCTGGTCAATCTCAGTCACTTGTTTCTCCATGGTAACAAG 

CTATGGAGCCTGGGCCAAGGCATCTTCCGGGGCCTGGTGAACCTGGACCGGTTGCTGCTG 

CATGAGAACCAGCTACAGTGGGTrCACCACAAGGCTTTCCATGACCTCCACAGGCTAACC 

ACCCTCTTTCTCTTCAACAACAGCCTCACTGAGCTGCAGGGTGACTGTCTGGCCCCCCTC 

GTGGCCTTGGAGTTCCTTCGCCTCAATGGGAATGCTTGGGACTGTGGCTGCCGGGCACGT 

TCCCTGTGGGAATGGCTGCGAAGGTTCCGTGGCTCTAGCTCTGCTGTCCCCTGCGCGACC 

CCCGAGCTGCGGGAAGGCCAGGATCTGAAGCTGCTGAGGGTGGAGGACTTCCGGAACT(^ 

ACAGGACCAGTGTCrcCTCACCAGATCAAGTCTCACACGCTTACCACCTCTGACAGGGCT 

GCCCGCAAGGAGCACCATCCGTC(XATGGGGCCTCCAGGGACAAAGGCCACCCACA^ 

CATCCGCCKKSCTCCAGGTCAGGTTACAAGAAGGCAGGCAAGAAC^ 

AACCGGAACCAGATCTCTAAGGTGAGCTCTGGGAAAGAGCTTACCGAACTGCAGGACTAT 

GCCCCCGACTATCAGCACAAGTTCAGCTTTGACATCATGCCCACCGCACGACCCAAGAGG 

AAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCAGTGGGGTGCAGCAGGCATCC 

TCAGGCACGGCCCTTGGGGCCCCACTCCTGGCCTGGATACTGGGGCTGGCAGTCACTCTC 

CGC 



The following protein sequence <SEQ ID NO. 4> is deduced protein of SEQ ID 
N0:3: 

MSWQSGTTVTQSPVQAAQVSGCCVELLLLLL 
AGELPLGGGCPRDCVCyPAPMTVSCQAHNFA 
AIPEGIPEDSERIFLQNNRITFLQQGHFSPA 
MVTLWIYSNNITFIAPNTFEGFVHLEELDLG 
DNRQLRTLAPETFQGLVKLHALYLYKCGLS A 
LPAGIFGGLHSLQYLYLQDNHIEYLQDDIFV 
DLVNLSHLFLHGNKLWSLGQGIFRGLVNLDR 
LLLHENQLQWVHHKAFHDLHRLTTLFLFNNS 
LTELQGDCLAPLVALEFLRLNGNAWDCGCRA 
RSLWEWLRRFRGSSSAVPCATPELRQGQDLK 
LLRVEDFRNCTGPVSPHQIKSHTLTTSD R A A 
RKEHHPSHGASRDKGHPHGHPPGSRSGYKKA 
GKNCTSHRNRNQISKVSSGKELTELQDYAPD 
YQHKFSFDIMPTARPKRKGKCARRTPIRAPS 
GVQQASSGTALGAPLLAWILGLAVTLR 
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The following protein sequence <SEQ ID NO, 5> is NgRl from humans: 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGA 
C 

CYNEPKVTTSCPQQGLQ A VP VOIP AASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVL 
ARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAA 
LQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHV 
HPHAFRDLGRLMTLYLFANNLS ALP TEAL AP 
LRALQYLRLNDNPWVCDCRARPLWAWLQKFR 
GSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAA 
DKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGS 
EPPGFPTSGPRRRPGCSRKNRTRSHCRLGQ A 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALV 
LWTVLGPC 

The following amino acid sequence <SEQ ID N0:6> is a Consensus Sequence of 
NgR based on homology with NgRl 

CPXXCXCYXXPXXTXSCXXXXXXXXPX 
XXPXXXXRXFLXXNXIXXXXXXXFXXXXXXXXLWX 
XSNXXXXIXXXXFXXXXXLEXLDLXDNXXLR 
XXXPXTFXGLXXLXLXLXXCXLXXLXXXXFX 
GLXXLQYLYLQXNXXXXLXDDXFXDLXNLXH 
LFLHGNXXXXXXXXXFRGLXXLDRLLLHXNX 
XXXVHXXAFXXLXRLXXLXLFXNXLXXLXXX 
XLAXLXXLXXLRLNXNXWXCXCRARXLWXWX 
XXXRXSSSXVXCXXPXXXXGXDLXXLXXXDX 
XXCXXXXXPXXPXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 
PPXXXSXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXRXXX 
XXXXXXXXXXXXXXXXXXXXXXXXLXXXXX 
XX X XX L 



The following protein sequence <SEQ ID N0:7> is the 66 amino acid active domain 
ofNogo: 

RIYKGVIQAIQKSDEGHPFRAYLESEVAISE 
ELVQKYSNSALGHVNCTIKELRRLFLVDDLV 
DS L K 
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The following protein sequence <SEQ ID N0:8> is the ammo acid sequence of the 
mature NgR2: 

CPMLCTCYSSPPTVSCQ-ANNFSSVPLSLPPS 
TQRLFLQNNLIRTLRPGTFGSNLLTLWLFSN 
NLSTIYPGTFRHLQALEELDLGDNRHLRSLE 
PDTFQGLERLQSLHLYRCQLSSLPGNIFRGL 
VSLQYLYLQENSLLHLQDDLFADLANLSHLF 
LHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
GVHRAAFRGLSRLTILYLFNNSLASLPGEAL 
ADLPSLEFLRLNANPWACDCRARPLWAWFQR 
ARVSS5DVTCATPPERQGRDLRALREADFQA 
CPPAAPTRPGSRARGNSSSNHLYGVAEAGAP 
PADPSTLYRDLPAEDSRGRQGGDAPTEDDYW 
GGYGGEDQRGEQMCPGAACQAPPDSRGPALS 
AGLPSPLLCLLLLVPHHL 



The following protein sequence <SEQ ID NO:9> is the amino acid sequence of the 
mature NgR3: 

CPRDCVCYPAPMTVSCQAHNFAAIPEGIPED 
SERIFLQNNRITFLQQGHFSPAMVTLWIYSN 
NITFIAPNTFEGFVHLEELDLGDNRQLRTLA 
PETFQGLVKLHALYLYKCGLSALPAGIFGGL 
HSLQYLYLQDNHIEYLQDDIFVDLVNLSHLF 
LHGNKLWSLGQGIFRGLVNLDRLLLHENQLQ 
WVHHKAFHDLHRLTTLFLFNNSLTELQGDCL 
APLVALEFLRLNGNAWDCGCRARSLWEWLRR 
FRGSSSAVPCATPELRQGQDLKLLRVEDFRN 
CTGPVSPHQIKSHTLTTSDRAARKEHHPSHG 
ASRDKGHPHGHPPGSRSGYKKAGKNCTSHRN 
RNQISKVSSGKELTELQDYAPDYQHKFSFDI 
MPTARPKRKGKCARRTPIRAPSGVQQASSGT 
A L G A PL L AW I L G L A VT LR 



The following amino acid sequence <SEQ ID NO: 10> is a conserved cysteine motif 
(Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: 
CPXXCXCYXXPXXTXSC 
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The following amino acid sequence <SEQ ID NO: 1 1> is a conserved cysteine motif 

(Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: 

NXWXCXCRARXLWXWXXXXRXSSSXVXCXXP 
XXX XGXDLXXLXXXDXXXC 



The following amino acid sequence <SEQ ID N0:12> is a conserved Leudne-rich 

domain of the NgR and homologs based on the Consensus Sequence: 

RXFLXXNXIXXXXXXXFXXXXXXXXLWXXSN 
XXXXIXXXXFXXXXXLEXLDLXDNXXLRXXX 
PXTFXGLXXLXLXLXXCXLXXLXXXXFXGLX 
XLQYLYLQXNXXXXLXDDXFXDLXNLXHLFL 
HGNXXXXXXXXXFRGLXXLDRLLLHXNXXXX 
VHXXAFXXLXRLXXLXLFXNXLXXLXXXXLA 
X L XX L XX LR L 



Unless otherwise indicated, X is any amino acid. For example, X where 
indicated may be no amino acid. Additional features of the invention will be apparent 
from the following Examples. Examples 1-5 are actual, while the remaining Examples 

20 are prophetic. 

As shown by the following Examples^ a gene encoding novel NgRs have been 
identified by computational analysis of DNA sequence data. The proteins encoded by 
T^gjRl and NgiR3 have a putative signal sequence, eight leucine-rich repeat domains in 
a conserved leucine-rich region (SEQ ID NO: 12), a conserved cysteine-rich region 

25 (SEQ ID NO: 10) N-terminal to the leudne-rich region, a second cysteine-rich domain 
(SEQ ID NO: 1 1) C-tenninal to the leudne-ridi region, and a putative 
glycophosphatidylinositoHinkage (GPI-linkage) site. NgR2 and NgiR3 diflfer from the 
previously identified NgR sequence. The NgiR homologs, when compared to known 
NgRs, show a consensus sequence (SEQ ID N0s:6). The putative mature NgR2 and 

30 NgJG are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively. 



Example 1: Tblastn query of the HTG database 

The protein sequence for the human Ngp. (NgRl) (SEQ ID N0:5) was used to 
query the high throughput genomic (HTG) database the use of which is familiar to 
35 those skilled in the art. The HTG database is a part of GenBank, a comprehensive 
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NIH genetic sequence database, which includes an annotated collection of all publicly 
available DNA sequences {Nucleic Acids Res. (2000) 28, 1 5-8). The HTG database 
includes sequences obtained from genomic DNA. Within genomic DNA, genes are 
typically encoded by multiple segments of DNA called exons. Thus when one aligns a 
5 cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic 
sequence, the sequence will be broken up into segments depending on the nimiber of 
exons in the gene. 

The BLAST algorithm, which stands for Basic Local Alignment Search Tool is 
suitable for determining sequence similarity (Altschul et al, (1990) J. Mol Biol 2 IS, 

10 403-410, which is incorporated herein by reference in its entirety). Software for 
performmg BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih,gov/). The basic BLAST 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 

IS positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et aL, supra). These initial neighborhood word hits act as seeds for initiating 
searches to find HSPs containing them. The word hits are extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. 

20 Extension for the word hits in each direction are halted when: 1) the cumulative 
alignment score falls ofFby the quantity X from its maximum achieved value; 2) the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
Blast algorithm parameters W, T and X determine the sensitivity and speed of the 

2S alignment. The Blast program uses as defaults a word length (W) of 1 1, the 

BLOSUM62 scoring matrix (see HemkoSet aL, (1992) Proc. Natl Acad. Set USA 
89, 10915-10919, which is incorporated herein by reference in its entirety) aligmnents 
(B) of SO, expectation (E) of 10, M=S, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin etal,, (1993) Proa Natl Acad. Sd. USA 90, 

30 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a 
statistical analysis of the similarity between two sequences. One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which 
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provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would Qccur by chance. For example, a nucleic add is 
considered similar to a NgR gene or cDNA if the smallest sum probability in 
comparison of the test nucleic acid to a NgR nucleic add is less than about 1, 

5 preferably less than about 0. 1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

To query the HTG database with the NgR protein sequence, we used a 
variation of the BLAST algorithm known as the tblastn program, which compares a 
protein query sequence against a nucleotide sequence database dynamically translated 

10 in aU readmg frames {J. Mol Biol (1990) 215, 403-410: Nucleic Acids Res. (1997) 
25, 3389-3402). The results of the tblastn search indicated the presence of genes in 
the database with a significant identity to the NgR Li addition to finding hits to 
genomic dones which contain the human and mouse NgR genes, we found hits to 
clones where the identity was not as high, but still very significant. Three human 

15 clones were found (Accession numbers: AC068514, AC016869, AC013606) with an 
e-value of 4e-43 and one mouse clone was foimd (Accession No. AC021768) with an 
e-value of le-78. The three human clones all appeared to encode the same gene, so 
fiirther analysis was confined to AC013606. 

20 Example 2: Prediction of the human NgR2 protein sequence (AC013606) 
The human NgR protein sequence aligned with two regions of translated 
sequence from nucleotide sequence AC013606, indicating that the new gene was 
encoded by at least two exons. In order to define the complete gene, we used the 
computer program GENSCAN™ {J. Mol Biol (1997) 268, 78-94) which can identify 

25 complete exon/intron structures of genes in genomic DNA The gene prediction by 
GENESCAN™ contained seven exons. By conq)aring these predicted exons to the 
NgR, it was conchided that the new himian gene contains two of these exons and a 
part of another (containing the initiating methiomne). The predicted cDNA (mRNA) 
encoded by these three exons was assembled from AC013606 (HTGl 1; deposited 

30 March 2000; length 143899; GenBank release 1 18.0; SEQ ID NO: 15) by combining 
nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 
130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA 
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sequence is SEQ ID N0:1 (nucleotide sequence of human NgR2; AC013606). The 
translation of tiiis cDNA provides tiie protein sequence of human NgR2 (SEQ ID 
N0:2). 

We used the protein sequence of human NgR2 as a query sequence against the 
5 human EST database. A number of hits of high significance were found indicating that 
the Ngjl2 mRNA is expressed in a number of tissues including fetal brain. 
Furthermore, two of these ESTs provided support for the exon structure that we 
deduced. One EST (Accession No: GBJEST19:AD46757) contains 565 nucleotides 
corresponding to amino acids 84-271 of tiie human NgR2 (SEQ ID No:4). This spans 
10 the second intron located between amino adds 171 and 172, and provides positive 
evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST 
(GBJEST26:AI929019) contains 545 nucleotides, part of which corresponds to ammo 
adds 1-75 of the human NgR2 (SEQ ID N0:2). This spans the first intron located 
between amino acids 10 and 1 1, and provides positive evidence for the splicing of 
15 exons 1 and 2 at the mRNA level. 

Example 3: Prediction of the mouse NgR3 protein sequence (AC021768) 

The hiunan NgR protein sequence aligned with only one region of translated 
sequence firom nucleotide sequence AC021768, indicating that most of the new mouse 

20 gene was encoded by one large exon. However, upon inspection, the protem encoded 
by this exon was missing an initiating methionine. In order to define the complete 
gene, we used the computer program GENSCAN as described above. The gene 
prediction by GENSCAN contained two exons; the large one foimd by visual 
inspection and a short one at the 5* end which provided an initiating methionine. The 

25 predicted cDNA (mKNA) encoded by these two exons was assembled firom AC021768 
(ETTGH; deposited March 2000; lengfli = 215980; GenBank rdease 1 18,0; SEQ ID 
NO: 16) by combining nucleotides firom the two exons whose coordinates are: the 
complement of 164265-164325 (exon 1); and the complement of 155671-156992 
(exon 2). The sequence for this cDNA sequence is SEQ ID N0:3 (nudeotide 

30 sequence of mouse NgR3; AC021768), The translation of this dDNA provides the 
protein sequence of mouse NgR3 (SEQ ID N0:4). 
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We used the protein sequence of mouse NgR3 as a query sequence against the 
mouse EST database. One hit of high significance was found indicating that the NgR2 
mRNA is expressed in the heart. This EST (GB_EST20:AI428334) contains 463 
nucleotides, part of \^ch correspond to amino acids 45-193 of mouse NgR3 (SEQ ID 
5 N0:4). 

Example 4: Similarity between the NgRs 

An alignment between NgRl and the two new receptors is shown in Fig. 
1 A-IB. The similarities between these protems include: 
10 (1) The Signal? program, which locates the signal sequence cleavage 

position, predicts a cleavage before the first conserved cysteine in all the proteins. 
Thus the mature protein in all cases will have a cysteine at the N-terminus. 

(2) M proteins contain eight I^cine Rich Repeats (LRR). LRRs are short 
sequence moti& present in a nimiber of protdns with diverse fiinctions and cellular 

15 locations. These repeats are usually invoked in protein-protein interactions. Each 
LRR is composed of a beta-alpha unit. 

(3) All three proteins contain a leucine rich repeat N-terminal domam 
(LRKNT), in which four cysteines are conserved. LRRs are often flanked by cysteine 
rich domains at both their N and C termini. 

20 (4) All three proteins contain a LRR C-terminal domain (LRRCT). The 

LRRCTs of the three NgR proteins can be distinguished fi-om those of other LRR 
containing proteins, by the pattern of typtophans and cysteines which are completely 
conserved in this domain, 

(5) All three proteins contain a conserved cysteine in the fourth LRR 
25 domain. 

(6) All three proteins contain a conserved potential glycos^ation site in the 
eighth LRR domain. 

(7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NgRl, an 
indication that they probably also undergo a modification similar to NgRl, where a 

30 GPI moiety is covalently linked to a C-t©rminal amino add. This allows the protem to 
remam tethered to the cell. 
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Example 5: Preparation of Nogo Proteins 

A Nogo binding assay was developed which utilizes a method widely used in 
^camining semaphorin and ephrin axonal guidance function (Flanagan & 
Vanderhaeghen (1998) Awm, Rev. NeuroscL 21,3 09-345; Takahashi etaL, (1999) 
S Cell 99, 59-69). It involves fusing a secreted placental alkaline phosphatase (AP) 

moiety to the ligand in question to provide a biologically active receptor binding agent 
which can be detected with an extremely sensitive colorimetric assay. For Nogo, an 
e?q)ression vector is created encoding a signal peptide, a His6 tag for purification, AP, 
and the 66 amino add active domain of Nogo. The fusion protein can be purified fi'om 

10 the conditioned medium of transfected cells in milligram amoimts. This protein is 
biologically active as a growth cone collapsing agent with an EC50 of 1 nM. 

Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein 
may be prepared. For GST-Nogo, an expression vector (e.g,, a pGEX vector) is 
created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. 

15 GST-Nogo may be purified fi-om the culture medium and used as a GST fiision 

protein, or GST may be cleaved firom the Nogo portion of the fiision protein with an 
enzyme that recognizes the specific amino acid cleavage sit engineered between the 
GST portion and the Nogo portion of the fiision protein. Such sites are part of the 
commerciaUy available GST vectors. The specific cleavage sites and enzymes may be 

20 used in accordance with the Maniifacturer's specifications. 

It has been found that AP-Nogo is actually slightly more potent than 
GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a 
prokaryotic cell. 

Binding of Nogo to immobilized NgR homologs may be performed in an 
25 ELISA-type assay in which AP-Nogo is allowed to react with an immobilized receptor 
homolog. Specificity of binding may be demonstrated in a comp^tive binding assay 
usmg increasing amounts of GST-Nogo in the type of assay to show a decreasing 
amount of binding of AP-Nogo (as judged in the colorimetric assay). 



30 



Example 6: Transfected COS Cell binding Assays 

The homologs of the present invention may be used in transfection studies in 
COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences 
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encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. 
Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS 
cells with nucleic acid sequences encoding NgRs will make them capable of binding 
Nogo. AP alone does not bind with any stable affinity to these transfected cells, 
5 indicating that any a£Smty of Nogo for NgR2 or NgR3 would be due to the 66 amino 
adds derived from Nogo. Furthemore, specific afiSnity of Nogo for the NgR2 or 
NgR3 protdns may be tested m displacement of AP-Nogo assays using GST-Nogo. 
NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested usmg 
this assay. 

10 

Example 7: Expression of NgR in Human Cell Lines using Northern Blot and a 
Random-Primed Probe 

A Northern blot is purchased from a commercial source, or RNA samples from 
cells of interest are run on an agarose gel and blotted to a membrane using any of the 

15 well known techniques for Northem blotting. The blot is probed with a fragment of 
NgR2 (SEQ ID NO: 1) or NgEG (SEQ ID N0:3). The probe is prepared from 50 ng 
of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) AnaL 
Biochem. 132, 6-13). Hybridization is carried out at 68"C for 1 hour in 
ExpressHybTM solution (Clontech, Cat. No. 8015-1) followed by washing with 2X 

20 SSC/0.05% SDS at room temperature and two washes with 0. IX SSC/0. 1% SDS at 
50"C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an 
appropriately sized band on the blot. 

Example 8: Cloning of cDNA corresponding to NgRs 

25 

To obtain the ftdl-length clone corresponding to NgR2 from a cDNA library, 
the following method may be used. A cDNA library is generated using standard 
methods from a tissue known to contain NgR2, Such a tissue was identified in 
Example 2. 1 x 10^ plaque forming imits from the cDNA library may be screened in 
30 duplicate on OPTTTRAN™ filters. The filters are hybridized with ^^P-labeled 

oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. 
The hybridization reaction may consist of 400 mis plaque screen buffer (SOmM Tris pH 
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7.5, IMNaCl, 0.1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% 
FicoU) containing 10% Dextran sulfate and 100|ig/ml tRNA and 80 pmol each 
^^P-labeled oligonucleotide at 65*^C overnight. The filters are washed twice with 2X 
SSC/1%SDS and twice with IX SSC/1%SDS and exposed to fihn. Duplicate positives 
S are purified. DNA fi-om each of these clones is analyzed by restriction enzyme digest 
followed by agarose gel electrophoresis and Southern blotting. The filters are 
hybridized to the ^^P-labeled oligonucleotides used for the original hybridization to 
confirm that inserts hybridize to the probe. The insert is then sequenced to confirm that 
it represents the cDNA for NgR2. Similar methods may be used to generate a 
10 fiiU-length clone correspondmg to Ng£R3. 

Alternatively, a fidl-length clone of NgR2 or NgR3 can be obtained by a 
person of ordinary skill in the art en:q)loying conventional PCR techniques. 

Example 9: Hybridization Analysis to demonstrate NgR expression in the brain 

15 The expression of NgR in mammals, such as the rat, may be investigated by in 

situ hybridization histochemistry. To investigate expression in the br^, for example, 
coronal and sagittal rat brain cryosections (20 thick) are prepared using a Reich^- 
Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-firee 
slides (CEL Assodates, Inc., Houston, TX), and stored at -80*C. Sections are 

20 processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold 

phosphate-bufifered saline (PBS), acetylated using acetic anhydride in triethanolamine 
bufifer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% . 
alcohol at room temperature. Subsequently, sections are delipidated in chloroform, 
followed by rehydration through successive exposure to 100% and 95% alcohol at 

25 room temperature. Microscope slides containing processed cryosections are allowed 
to air dry prior to hybridization. Other tissues may be assayed in a similar fiishion. 

A NgR-specific probe may be generated using PCR, Following PCR 
amplification, the fi'agment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 

30 the sense strand of NgR, a cloned NgR Segment cloned in pBluescript n may be 
linearized with a suitable restriction enzyme, which provides a substrate for labeled 
run-oflF transcripts (j.e., cRNA riboprobes) using the vector-borne T7 promoter and 



t 
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commercially available T7 RNA polymerase. A probe specific for the antisense strand 
of NgR may also be readily prepared using the NgR clone in pBluescript n by cleaving 
the recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off cRNA transcripts using the T3 promoter 

5 and cognate polymerase. The riboprobes may be labeled with [^^S]-XJTP to yield a 
specific activity of about 0.40 x 10^ cpm/pmol for antisense riboprobes and about 0.6S 
X 10^ cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently 
denatured and added (2 pmol/ml) to hybridization buffer which contains 50% 
formamide, 10% dextran, 0.3 MNaCl, 10 mM Tris (pH 8.0), 1 mM EDTA, IX 

10 Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing 

sequential brain ciyosections may be independently exposed to 45 |il of hybridization 
solution per slide and silanized cover slips may be placed over the sections being 
exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 
52^ to allow hybridization to occur. Equivalent series of cryosections are then 

15 exposed to sense or antisense NgR-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 ixg/ml 
RNase A in a buffer contaming 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 mmutes at 37"C. The cryosections are then subjected to three 

20 high-stringency washes in 0. 1 X SSC at 52"C for 20 mmutes each. Following the 
series of washes, ciyosections are dehydrated by consecutive exposure to 70%, 95%, 
and 100% ammonium acetate in alcohol, followed by air drying and e}q>osure to 
Kodak BioMax™ MK-1 film. After 13 days of exposure, the film is developed, and 
any significant Iqrbridization signal is detected. Based on these results, slides containing 

25 tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB- 
2 nuclear track emuMon and the slides are stored in the dark for 32 days. The slides 
are then developed and counterstained with hematoxylin. Emulsion-coated sections 
are analyzed microscopically to determine the specificity of labeling. The signal is 
determined to be specific if autoradiographic grams (generated by antisense probe 

30 hybridization) are clearly associated with cresyl violate-stained cell bodies. 

Autoradiographic grains foimd between cell bodies indicate non-specific binding of the 
probe. 
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In some cases, such as using a probe to detect aNgR homolog in a 
heterologous species, in order to achieve optimal hybridization, it may be necessary to 
decrease the stringency conditions. Such conditions are well known to those of 
ordinary skill in the art and examples are provided above. 
5 Expression of NgR in the brain provides an indication that modulators of NgR 

activity have utility for treating neurological disorders. Some other diseases for which 
modulators of NgR may have utility include depression, anxiety, bipolar disease, 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of NgR 
modulators, inchiding NgR ligands and anti-NgR antibodies, to treat individuals having 
10 such disease states is intended as an aspect of the invention. 

Example 10: Northern Blot Analysis of NgR-RNA with a PCR-generated Probe 

Northern blot hybridizations may be performed to examine the expression of 
NgR mRNA. A clone containing at least a portion of the sequence of SEQ ID NO: 1 
15 may be used as a probe. Vector-specific primers are used in PCR to generate a 
hybridization probe firagment for P-labeling. The PCR is performed as follows: 



Mix: NgR-containmg plasmid 

2iil fwd primer (10-50 pM) 
20 2jil rev primer (10-50 pM) 

10\lI 1 OxPCR buffer (such as that provided with the enzyme, 

Amersham Pharmacia Biotech) 
1 111 1 OmM dNTP (such as #1 969 064 from Boehringer Mannheim) 
O.S^xl Taq polymerase (such as #27-0799-62, Amersham Pharmada 
25 Biotech) 
83.5|jil water 
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PCR is performed in a Thennocsycler using the following program: 



94-C 



Smin 



5 



94'C 
55-C 
72'C 



Imin 
Imin 
Imin 



30 cycles 



72-C 



lOmin 



10 



The PGR product may be purified using QIAquick PGR Purification Kit 
• (#28104) firom Qiagen, and radictively labeled with ^^P-dCTP (#AA0005/250, 
Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go 
DNA Labelmg Beads" (#27-9240-01) fi-om Amersham Pharmacia Biotech. 

15 Hybridization is carried out on Human Multiple Tissue Northern Blot fi-om Glontech as 
described in manufacturer's protocol, or on a Northern Blot prepared by running RNA 
samples from cells of interest on an agarose gel and blotting to a membrane using any 
of the known Northern blotting protocols. After exposure overnight on Molecular 
Dynamics Phosphor Imager screen (#MD146-814) bands of an appropriate size are 

20 visualized. 

Example 11: Recombinant Expression of NgR in Eukaryotic Host Cells 



25 suitable host cell using a suitable expression vector and standard genetic engineering 
techniques. For example, a NgR-encoding sequence described in Table 4 is subcloned 
into the commercial expression vector pzeoSV2 (Invitrogen, San Diego, CA) and 
transfected into Ghinese Hamster Ovary (CHO) cells using the transfection reagent 
FuGENE6™ (Boehringer-Mannheim) and the transfection protocol provided in the 

30 product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 
293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by 
growth in the presence of 100 pg/ml zeocin (Stratagene, LaJoUa, CA). As an 
alternative to FuGENE6™, the expression vector may carry the gene for dihydrofolate 
reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure 



A. Expression of NgR in Mammalian Cdls 

To produce NgR protein, a NgR-encoding polynucleotide is expressed in a 
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allows for stable transformation of CHO cells. Optionally, NgR may be purified fi-om 
the cells using standard chromatographic techniques. To facilitate purification, antisera 
is raised against one or more synthetic peptide sequences that correspond to portions 
of the NgR amino add sequence, and the antisera is used to aflSnity purify Nogo-R. 
S The NgR also may be e3q)ressed in-firame with a tag sequence (e.g. , polyhistidine, 
hemaglutinin, FLAG) to fadlitate purification. Moreover, it will be appreciated that 
many of the uses for NgR polypeptides, such as assays described below, do not require 
purification of NgR firom the host cell. 

10 B. Expression of NgR in CHO cells 

For expression of NgR in Chinese hamster ovary (CHO) cells, a plasnrid 
bearing the relevant NgR coding sequence is prepared, using a vector which also bears 
the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into 
CHO cells. Selection under MTX drug pressure allows for preparation of stable 
15 transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a 
selectable marker such as DHFR. 

C. Expression of NgR in 293 cells 

For e?q)ression of NgR in mammalian cells 293 (transformed human, primary 
20 embryonic.kidney cells), a plasmid bearing the relevant NgfR coding sequence is 
prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the 
murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the 
recombinant protein with the anti-myc antibody, a C-terminal polyhistidine for 
purification with nickel chelate chromatography, and a Zeocin resistant gene for 
25 selection of stable transfectants. The fbrward primer for amplification of this NgR 
cDNA is determined by routine procedures and preferably contains a 5' extension of 
nucleotides to introduce the ffindlll cloning site and nucleotides matching the NgR 
sequence. The reverse primer is also determined by routine procedures and preferably 
contains a 5* extension of nucleotides to introduce an Xhol restriction site for cloning 
30 and nucleotides corresponding to the reverse complement of the Ngjl sequence. The 
PCR conditions are 55'C as the annealing temperature. The PCR product is gel 
purified and cloned into the jfiR/idlH-AIfcoI sites of the vector. 
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The DNA is purified using Qiagen chromatography cxjlunms and transfected 
into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 
Indianapolis, IN)- Transiently transfected cells are tested for expression after 24 hours 
of transfection, using western blots probed with anti-His and anti-NgR peptide 
S antibodies. Permanently transfected cells are selected with Zeodn and propagated. 
Production of the recombinant protem is detected ftom both cells and media by 
Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies. 

D. Transient Expression of Nogo-R in COS cells 

10 For expression of the NgR in C0S7 cells, a polynucleotide molecule having a 

nucleotide sequence of SEQ ID N0:1, for example, can be cloned into vector p3-CL 
This vector is a pUC18-derived plasmid that contains the HCMV (human 
cytomegalovirus) promoter-intron located upstream fi*om the bGH (bovine growth 
hormone) polyadenylation sequence and a multiple cloning site. 

15 The forward primer is determined by routine procedures and preferably 

contains a 5* extension which introduces mXbal restriction site for cloning, followed 
by nucleotides which correspond to a nucleotide sequence of SEQ ID NO: 1 . The 
reverse primer is also determined by routine procedures and preferably contains 5 - 
extension of nucleotides which introduces a Sail cloning site followed by nucleotides 

20 which correspond to the reverse complement of a nucleotide sequence of SEQ ID 
N0:1. 

The PGR con^sts of an initial denaturation step of S min at 9S'C, 30 cycles of 
30 sec denaturation at 9ST, 30 sec annealing at S8'C and 30 sec extension at 72*C, 
followed by S min extension at 72*C. The PGR product is gel purified and ligated into 

25 the Xbal and Sail sites of vector p3-CL This construct is transformed into E. coli cells 
for amplification and DNA purificatioa The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent firom BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
after transfection, the media and the cells are tested for recombinant protein 

30 ^ression. 

NgR expressed firom a COS cell culture can be purified by concentrating the 
cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 
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example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80*C. NgR3 may also be 
expressed using this method and the nucleotide sequence of SEQ ID K0:3 or SEQ ID 
N0:13. 

5 

E. Expression of NgR in Insect Cdls 

For expression of NgR in a baculovirus system, a polynucleotide molecule 
having a nucleotide sequence of SEQ ID NO: 1, 3 or 13 can be amplified by PCR. The 
forward primer is determined by routine procedures and preferably contains a S* 

10 extension which adds the Ndel cloning site, followed by nucleotides which correspond 
to a nucleotide sequence of SEQ ID NO: 1 (or SEQ ID N0:3 or SEQ ID NO: 13, 
respectively). The reverse primer is also determined by routine procedures and 
preferably contains a 5* extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of a nucleotide sequence of 

15 SEQ ID NO: 1 (or SEQ ID N0:3 or SEQ ID NO: 13, respectively). 

The PCR product is gel purified, digested with Ndel and Kpn\ and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcETTL expres^on vector contains the strong polyhedrin promoter of the 
Autogrcpha califomica nuclear polyhedrosis virus (AcMNPV), and a 6XI£s tag 

20 upstream from the multiple cloning site. A protein kinase site for phosphorylation and 
a thrombin site for excision of the recombinant protein precede the multiple cloning 
site is also present. Of course, many other baculovirus vectors could be used in place 
of pAcHTL-A, such as pAc373, pVL941 and pAdMl. Other suitable vectors for the 
expres^on of NgR polypeptides can be used, provided that the vector construct 

25 includes appropriately located signals for transcription, translation, and trafficking, 

such as an in-frame AUG and a signal peptide, as required. Such vectors are described 
inLuckow etal^ Virology 170:31-39, among others. 

The virus is grown and isolated using standard baculovirus expression methods, 
such as those described in Summers et al (1987) A MANUAL OF METHODS FOR 

30 Baculovirus Vectors and Insect Cell (Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. ISSS. 
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In a preferred embodiment, pAcEILT-A containing NgR gene is introduced into 
baculovirus using the "BaculoGold™" transfection kit (Pharmingen, San Diego, CA) 
using methods established by the manufacturer. Individual virus isolates are analyzed 
for protein production by radiolabeling infected cells with ^^S-methionine at 24 hours 
S post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can 
be isolated and used for scaled up expres^on. 

For e7q)res^on of a NgR polypeptide in a S& cells, a polynucleotide molecule 
having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID N0:3 or SEQ ID 

10 NO: 13) can be amplified by PGR using the primers and methods described above for 
baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A 
(Pharmingen) for expression in Sf9 insect. The insert is cloned into the Ndel and Kpnl 
sites, after elimination of an internal Ndel site (using the same primers described above 
for expression in baculovirus). DNA is purified with Qiagen chromatography coliamns 

15 and expressed in S0 cells. Preliminary Western blot experiments fi*om non-purified 
plaques are tested for the presence of the recombinant protein of the expected size 
which reacted with the NgR-specific antibody. These results are confirmed after 
fiirth^ purification and expression optimization in HiGS cells. 

20 F. Expression of soluble forms of NgR2 and NgS3 as NgR-Ig fusion 

proteins. 

To generate a NgR2-Ig fimon protdn, standard methods may be used as 
described in the literature {e.g, Sanicola et al (1997) Proc, Natl Acad Set USA, 94, 
6238-6243). For example, a DNA fi-agment encoding NgEl2 without tiie sequence 

25 encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA 
fi-agment encoding the Fc domain of IgGl (which may be human IgGl), and the 
chimeric fragment may be cloned into an expression vector to generate a plasmid. The 
plasmid may then be transfected into Chinese hamster ovary cells to generate a stable 
cell line producing the fiision protein. The fiision protein is then purified from 

30 conditioned media using standard methods. For example, clarified conditioned media 
from the cell line may be loaded by gravity directiy onto Protein A Sepharose. The 
column may then be washed with five column volumes each of PBS, PBS containing 
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0.5 M NaCl, and 25 mM sodium phosphate, 100 mM NaCl (pH 5.0). The bound 
protem may then be eluted with 25 mM NaH2P04, 100 mM NaCl (pH 2.8) and 
immediately neutralized with 1/10 fraction volume of 0.5 M Na2HP04 (pH 8.6). 
Similar methods may be used to generate a NgR3-Ig fusion protein. 

5 

Example 12: Interaction Trap/Two-Hybrid System 

In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid 
library screening method can be used. This assay was first described in Fields et aL 
(1989) Nature 340, 245, which is incorporated herein by reference in its entirety. A 

10 protocol is published in CURRENT Protocols inMolecularBiology 1999, John 
Wiley & Sons, NY and Ausubel, F. M, etal 1992, SHORT PROTOCOLS IN MOLECULAR 
Biology, fourth edition, Greene and Wiley-interscience, NY, which is incorporated 
herein by reference in its entirety. Kits are available fi-om Clontech, Palo Alto, CA 
^datchmaker Two-Hybrid System 3). 

15 A fusion of the nucleotide sequences encoding all or partial NgR and the yeast 

transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (i.^., pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed m a second plasmid (7.^., 
pGADTT) from cDNA of potential NgR-binding proteins (for protocols on forming 

20 cDNA libraries, see Sambrook ei al 1989, MOLECULAR CLONING: ALaboratory 
Manual, second edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY), which is incorporated herein by reference in its entirety. The DNA-BD/NgJR 
fiision construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 

25 analysis. Similar controls are performed with the AD/library fusion construct to msure 
expression in host cells and lack of transcriptional activity. Yeast cells are transformed 
(ca. 105 transformants/mg DNA) with both the NgR and library fiision plasmids 
according to standard procedure (Ausubel, et al, 1992, SHORT PROTOCOLS IN 
Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, which is 

30 incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgiR 
with AD/library proteins results in transcription of specific yeast plasmid reporter 
genes (/.e., lacZ, HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-defident 
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media to screen for expression of reporter genes. Colonies are dually assayed for 
P-galactosidase activity upon growth in Xgal 

(5-bromo-4-chloro-3-indolyl-b-D-galactoside) supplemented media (filter assay for 
b-galactosidase activity is described in Breeden etal, (1985) Cold Spring Harh. Symp. 

5 Quant. Biol^ 50, 643, which is incorporated herein by reference in its entirety). 

Positive AD-libraiy plasmids are rescued from transformants and reintroduced into the 
origmal yeast strain as well as other strains contaming unrelated DKA-BD fusion 
proteins to confirm spedfic NgR/Iibrary protein interactions. Insert DNA is sequenced 
to verify the presence of an open readmg frame fused to GAL4 AD and to determine 

10 the identity of the NgR-binding protdn. 

Example 13: Antibodies to Nogo-R 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the NgR receptor, and to generate useful antigen-binding Segments 

15 thereof or variants thereof, including "humanized" variants. Such protocols can be 
found, for example, in Sambrook etal (1989), above, and Harlow et al (Eds.), 
Antibodies A Laboratory manual; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1988). In one embodiment, recombinant NgR polypeptides (or 
cells or cell membranes containing such polypeptides) are used as antigen to generate 

20 the antibodies. In another embodiment, one or more peptides havmg amino add 

sequences corresponding to an immimogenic portion of NgR {e.g., 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular 
portions, are preferred. The antigen may be mbced with an adjuvant or linked to a 

25 hapten to uicrease antibody productioa 

A* Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant NgR or a synthetic jfragment thereof 
is used to immunize a mouse for generation of monoclonal antibodies (or larger 
30 manmial, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 

peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the 
manu&cturer's recommendations. For an initial injection, the antigen is emulsified with 
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Frexmtfs Complete Adjuvant and injected subcutaneously. At intervals of two to three 
weeks, additional aliquots of NgR antigen are emulsified with Freimd's Incomplete 
Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum 
sample is taken firom the immunized mice and assayed by western blot to confirm the 
S presence of antibodies that immimoreact with NgR. Serum fi:om the immunized 

animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that 
recognize NgR. Alternatively, the mice are sacrificed and thdr spleen removed for 
generation of monoclonal antibodies. 

To generate monoclonal antibodies, the spleens are placed m 10 ml serum-fi'ee 

10 RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum- 
free RPMI 1640, supplemented with 2 mM L-glutamine, 1 mM sodium pyruvate, 100 
units/ml penicillin, and 100 p.g/ml streptomycin (RPMI) (Gibco, Canada). The cell 
suspensions are filtered and washed by centrifugation and resuspended in serum-firee 
RPMI. Thymocytes taken fi-om three naive Balb/c mice are prepared in a similar 

15 manner and used as a Feeder Layer. NS-1 myeloma cells, kept in log phase in RPMI 
with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three 
days prior to fusion, are centrifiiged and washed as well. 

To produce hybridoma fiisions, spleen cells fi-om the immunized mice are 
combined with NS-1 cells and centrifiiged, and the supernatant is aspirated. The cell 

20 pellet is dislodged by tapping the tube, and 2 ml of 37*C PEG 1500 (50% in 75 mM 
H^ES, pH 8.0) OBoehring^-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free KPMI. Thereafter, the cells are centrifiiged, resuspended in 
RPMI containing 15% FBS, 100 ^iM sodhim hypoxanthme, 0.4 aminopterin, 16 
^M thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 10^ 

25 thymocytes/ml, and plated into 10 Coming flat-bottom 96-well tissue culture plates 
(Coming, Coming, NY). 

On days 2, 4, and 6 after the fusion, 100 \il of medium is removed from the 
wells of the fiision plates and replaced with fresh medium. On day 8, the fiisions are 
screened by ELISA, testing for the presence of mouse IgG that binds to NgR. 

30 Selected fusion wells are fiuther cloned by dilution until monoclonal cultures 
producing anti-NgR antibodies are obtained. 



wo 02/29059 



PCT/USOl/31488 



- 125 - 

B. Humanization of anti-NgR monoclonal antibodies 

The expression pattern of NgR as reported herein and the potential of NgRs as 
targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors 
(antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as 

S Ng^ antagonists. . Following are protocols to improve the utility of anti-NgR 
monoclonal antibodies as therapeutics in humans by **humanizmg" the monoclonal 
antibodies to improve their serum half-life and render them less immimogenic in human 
hosts {i.e., to prevent human antibody response to non-human anti-NgR antibodies). 
The principles of humanization have been described in the literature and are 

10 fecilitated by the modular arrangement of antibody protems. To minimize the 
possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of interest 

15 with the constant domains of human antibody molecules. (See, e.g., Morrison et al, 
{\9%9)Ad\^. Immunol, 44, 65-92). The variable domains of NgiR-neutralizing anti- 
NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from 
cDNA generated from mRNA isolated from the hybridoma of interest. The V region 
gene fragments are linked to exons encoding human antibody constant domains, and 

20 the resultant construct is e?qpressed in suitable mammalian host cells (e.g., myeloma or 
CHO cells). 

To achieve an even greater level of hmnanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are cloned 

25 into human antibody sequences. (See, e.g., Jones et al, (1986) Nature 321, 522-525; 
Riechmann et al, (1988) Nature 332, 323-327; Verhoeyen et al, (1988) Science 239, 
1534-1536 ; and Tempest etal, (1991) Bio/Technology 9, 266-271). If necessary, the 
6-sheet framework of the hiraian antibody surrounding the CDR3 regions also is 
modified to more closely mirror the three dimensional structure of the antigen-binding 

30 domain of the original monoclonal antibody. (See Kettieborough et al, (1991) Protein 
Engin. 4, 773-783; and Foote et al, (1992) J. MoL BioL 224, 487-499). 
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In an alternative approach, the surface of a non-human monoclonal antibody of 
interest is humanized by altering selected surface residues of the non-hmnan antibody, 
e.g., by site-directed mutagenesis, while retaining all of the interior and contacting 
residues of the non-human antibody. See Padlan (1991) MoL Immunol 28, 489-498 . 
S The foregoing approaches are employed using Ng^-neutralizing anti-Ng|l 

monoclonal antibodies and the hybtidomas that produce them to generate humanized 
NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions 
wherein NgR expression or ligand-mediated NgiR signaling is detrimental. 

10 C. Human NgR-Neutralizing Antibodies from Phage Display 

Human NgR-neutralizing antibodies are generated by phage display techniques 
such as those described in Aujame et al (1997) Human Antibodies 8, 155-168; 
Hoogenboom (1997) TIBTECH 15, 62-70; and Rader et al (1997), Curr. Opin. 
Biotechnol 8, 503-508, all of which are incorporated by reference. For example, 

15 antibody variable regions in the form of Fab fragments or Unked single chain Fv 

fragments are fused to the amino terminus of filamentous phage minor coat protein 
pin. Expression of the fiision protein and incorporation thereof into the mature phage 
coat results in phage particles that present an antibody on their surface and contain the 
genetic material encoding the antibody. A phage Ubrary comprising such constructs is 

20 expressed in bacteria, and the library is screened for Ng^R-specific phage-antibodies 
using labeled or immobilized NgR as antigen-probe. 

D. Human NgR-neutralizing antibodies from transgenic mice 
Himoan NgR-neutralizing antibodies are generated in transgenic mice essentially 
25 as described in Bruggemann et al (1996) Immunol Today 17, 391-397 and 

Bruggemannefa/. (1997) Curr. Opin. Biotechnol 8, 455-458. Transgenic mice 
carrying human V-gene segments in germline configuration and that express these 
transgenes in their lymphoid tissue are immunized with a NgR composition using 
conventional immunization protocols, hybridomas are generated using B cells from the 
30 immunized mice using conventional protocols and screened to identify hybridomas 
secreting anti-NgR human antibodies (e.g., as described above). 
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Example 14: Assays to Identify Modulators of NgR Activity 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of NgR activity. Among the modulators that can be 
identified by these assays are natural ligand compounds of the receptor; synthetic 

S analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic conq>ounds identified by high-throughput 
screening of libraries; and the like. All modulators that bind NgR are usefiil for 
identifying NgiR in tissue samples (eg., for diagnostic purposes, pathological purposes, 

10 and the like). Agomst and antagonist modulators are useful for up-regulatioig and 
down-regulating NgR activity, respectively, to treat disease states characterized by 
abnormal levels of NgR activity. The assays may be performed using single putative 
modulators, and/or may be performed using a known agonist in combination with 
candidate antagonists (or visa versa). 

15 

A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured in NgR-transfected cells that have been e)q)osed to candidate modulator 
compounds. Protocols for cAMP assays have been described in the literature. (See, 

20 e.g., Sutherland et aL, (1968) Circulation 37, 279; Frandsen et al, (1976) Life 

Sciences 18, 529-541; Dooley etcd,, (1997) J. Pharmacol Exp. Therap. 283, 735-41; 
and George et al., (1997) J. BiomoL Screening 2, 235-40). An exemplary protocol for 
such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN^™ 
Life Science Products, is set forth below. 

25 Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) 

is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and 
transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, 
such as the transfection protocol provided by Boehringer-Mannheim when supplying 
the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well 

30 microplates from the FlashPlate® assay kit, which are coated with solid scintillant to 
which antisera to cAMP has been bound. For a control, some wells are seeded with 
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wild type (untransfected) CHO cells. Other wells in the plate receive various amounts 
of a cAMP standard solution for use in creating a standard curve. 

One or more test compounds (j.c., candidate modulators) are added to the cells 
in each well, with water and/or compound-firee medium/diluent serving as a control or 

5 controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buiB^ 
containing [^^r|-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [^^-cAMP compete for antibody bound to the 

10 plate. A standard curve is constructed, and cAMP values for the unknowns are 

obtained by interpolation. Changes in intracellular cAMP levels of cells in response to 
exposure to a test compound are indicative of NgR modulating activity. Modulators 
that act as agonists of receptors which couple to the subtype of G proteins will 
stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP 

15 levels. Agonists of receptors which couple to the Gy^ subtype of G proteins will inhibit 
forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP 
levels of 50-100%. Modulators that act as mverse agonists will reverse these effects at 
receptors that are either constitutively active or activated by known agonists. 



20 B. Aequorin Assays 

In another assay, cells {e.g., CHO cells) are transientiy co-transfected with both 
a NgR e?q>ression construct and a construct that encodes the photoprotein apoaquorin. 
In the presence of the co&ctor coelenterazine, apoaquorin will emit a measurable 
limiinescence that is proportional to the amoimt of intracellular (cytoplasmic) free 

25 caldum. (See generally, Cobbold, et al "Aequorin measurements of cytoplasmic free 
calcium," In: McCormack J.G. and Cobbold P.Ii, eds., Cellular Calcium: A 
Practical Approach. Oxford:IRL Press (1991); Stables et al, (1997) Anal 
Biochem. 252, 1 15-26; and Haugland, HANDBOOK OF FLUORESCENT PROBES AND 
Research Chemicals. Sfacth edition. Molecular Probes, Eugene, OR (1996)). 

30 In one exemplary assay, NgR is subcloned into the commercial e)q)ression 

vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that 
encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) into CHO cells 
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using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection 
protocol provided in the product insert. 

The cells are cultured for 24 hours at 3TC in MEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 

5 U/ml penidllin and 10 |ig/ml streptomydn, at which time the medium is changed to 
serum-free MEM containing S |jM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37'C. Subsequently, cells are 
detached from the plate using V^SEN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

10 Dilutions of candidate NgR modulator compounds are prepared in serum-free 

MEM and dispensed into wells of an opaque 96-well assay plate at 50 |il/well. Plates 
are then loaded onto an MLX microtiter plate luminometer (Dynex Technologies, Inc., 
Chantilly, VA). The instrument is programmed to dispense 50 jil cell suspensions into 
each well, one well at a time, and immediately read limiinescence for 15 seconds. 

15 Dose-response curves for the candidate modulators are constructed using the area 
under the curve for each light signal peak. Data are analyzed with SlideWrite, using 
the equation for a one-site ligand, and EC50 values are obtained. Changes in 
luminescence caused by the compounds are considered indicative of modulatory 
activity. Modulators that act as agonists at receptors which couple to the Gq subtype 

20 of G proteins give an increase in luminescence of up to 100 fold. Modulators that act 
as inverse agonists will reverse this effect at receptors that are either constitutively 
active or activated by known agonists. 

C. Ludferase Reporter Gene Assay 

25 The photoprotein ludferase provides another usefiil tool for assaying for 

modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co- 
transfected with both a NgR expression construct (e.g. , NgR in pzeoSV2) and a 
reporter construct which includes a gene for the luciferase protein downstream from a 
transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or 

30 NF-kappa B . Expression levels of luciferase reflect the activation status of the 

signaling events. (See generally, George et al (1997) X Biomol Screening 2, 235- 
240; and Stratowae/a/. (1995) Ctnr. Opin. BiotechnoL 6, 574-581). Luciferase 
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activity may be quantitatively measured using, e,g., luciferase assay reagents that are 
commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 cells/well one day prior to transfection and cultured at 3TC in 
5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
U/ml penicillin and 10 jig/ml streptomycin. Cells are transiently co-transfected with 
both a NgR expression construct and a reporter construct containing the luciferase 
gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF-kappaB- 
ludferase may be purchased from Stratagene (Legally, CA). Transfections are 

10 performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's mstructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37'C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 

15 incubated at 37'C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 \i\ of lysis buffer per well from the luciferase assay kit 
supplied by Promega. After mcubation for 15 minutes at room temperature, 15 p.1 of 
the lysate is nuxed with 50 p.1 of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 

20 IVEcroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, 
MD), 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3-20-fold stimulation of 
25 luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 

D. Intracellular calcium measurement using FLEPR 

Changes in intracellular calcium levels are another recognized indicator of 
30 receptor activity, and such assays can be employed to screen for modulators of NgR 
activity. For example, CHO cells stably transfected with a NgR expression vector are 
plated at a density of 4 x lO"^ cells/well in Packard black-walled, 96-well plates 
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specially designed to discrixninate fluorescence signals emanating from the various 
wells on the plate. The cells are incubated for 60 minutes at 37'C in modified 
Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the 
addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3™ 
5 AM, Fluo-4™ AM, Calcium Green^^-l AM, or Oregon Green"* 488 BAPTA-1 AM), 
each at a concentration of 4 |aM. Plates are washed once with modified D-PBS 
without 1% fetal bovine serum and mcubated for 10 minutes at 37'C to remove 
residual dye from the cdlular membrane. In addition, a series of washes with modified 
D-PBS without 1% fetal bovine serum is performed immediately prior to activation of 

10 the calcium response. 

A calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23187 (10 |iM; positive control), or 
ATP (4 jiM; positive control). Fluorescence is measured by Molecular Device's 
FLIPR with an argon laser (excitation at 488 nm). (See, e.g, , Kuntzweiler et al 

15 (1998) Drug Dev. Res. 44,14-20). The F-stop for the detector camera is set at 2.5 and 
the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 
20 seconds prior to addition of candidate agonist, ATP, or A23 187, and the basal 
fluorescence level is subtracted from the response signal. The calciiun signal is 
measured for approximately 200 seconds, taking readings every two seconds. Calciimi 

20 ionophore A23 1 87 and ATP increase the calcium signal 200% above baseline levels. 
In general, activated NgRs increase the calcium signal at least about 10-15% above 
baseline signal. 

E. I^^S]GTPyS Binding Assay 

25 It is also possible to evaluate i^^ether NgR signals through a G 

protdn-mediated pathway. Because G protein-coupled receptors signal through 
intracellular G proteins vAxose activity involves GTP binding and hydrolysis to yield 
bound GDP, measurement of binding of the non-hydrolyzable GTP analog [^^S]- 
GTPyS in the presence and absence of candidate modulators provides another assay 

30 for modulator activity. (See, e.g., Kowal et al., (1998) Neuropharmacology 37, 179- 
187.). 



wo 02/29059 PCTAJSOl/31488 

-132- 

In one exemplary assay, cells stably transfected with a NgR expression vector 
are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of 
ice-cold Ca^'^/Mg^^-free phosphate-buffered saline, and scraped into 5 ml of the same 
buffer. Cells are pelleted by centrifiigation (500 x 5 minutes), resuspended in TEE 
5 buffer (25 mM Tris, pH 7.5 , 5 mM EDTA, 5 mM EGTA), and frozen in Uquid 

nitrogen. After thawing, the cells are homogenized using a Doimce homogenizer (1 ml 
TEE per plate of cells), and centrifuged at 1,000 x g for 5 minutes to remove nuclei 
and unbroken cells. 

The homogenate supernatant is centrifuged at 20,000 x g for 20 mmutes to 

10 isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgClj, 
1 mM 0)TA). The resuspended membranes can be frozen in liquid nitrogen and 
stored at -70"C until use. 

Aliquots of cell membranes prepared as described above and stored at -70*C 

15 are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl2, 1 mMEDTA, 120 mMNaCl, 10 xlVI GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 |ig/ml. In a final volume of 90 |il, homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 yM GTP for 
30 minutes at 30'C and then placed on ice. To each sample, 10 jil guanosine 

20 5'-0-(3[^^S]thio) triphosphate (NEN, 1200 Ci/mmol; [^^S]-GTPyS), was added to a 
final concentration of 100-200 pM. Samples are mcubated at 30'C for an additional 30 
minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCla, at 4'C is added and the 
reaction is stopped by filtratioiL 

Samples are filtered over Whatman GF/B filters and the filters are washed with 

25 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgClj. Filters are counted by liquid 
scintillation spectroscopy. Nonspecific binding of [ Sj-GTPyS is measured in the 
presence of 100 pM GTP and subtracted from the total. Compounds are selected that 
modulate the amount of [^^S]-GTPyS binding in the cells, compared to untransfected 
control cells. Activation of receptors by agonists gjves up to a five-fold increase in 

30 [^^S]-GTPyS binding. This response is blocked by antagonists. 



F. Arachidonic Acid Rdease 
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The activation of NgRs may also potentiate arachidonic acid release in cells, 
providing yet another useful assay for modulators of NgR activity. (See, e.g., 
Kanterman etal., (1991)M?/. Pharmacol 39,364-369.) For example, CHO cells that 
are stably transfected with a NgR expression vector are plated in 24- well plates at a 

5 density of 1 5,000 cells/well and grown in MEM medium supplemented with 10% fetal 
bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 ^g/ml streptomycin for 48 
hours at 37°C before use. Cells of each well are labeled by incubation with 
[^-arachidonic acid (Amersham Corp., 210 Ci/mmol) at O.S (iCi/ml in 1 ml MEM 
supplemented with 10 mM HEPES, pH 7.5, and 0.5% fetty-acid-free bovine serum 

10 albumin for 2 hours at 37*C. The cells are then washed twice with 1 ml of the same 
buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 |iM ATP and the cells are incubated at 37*C for 30 minutes. Buffer 
alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
15 well are coimted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [^H]-arachidonic 
acid. This potentiation is blocked by antagonists. 

G, Extracellular Acidification Rate 

20 In yet another assay, the effects of candidate modulators of NgP. activity are 

assayed by monitoring extracellular changes in pH induced by the test compounds (see, 
e,g,, Dunlop et al (1998) 1 Pharmacol. ToxicoL Meth 40, 47-55). In one 
embodiment, CHO cells transfected with a NgiR expression vector are seeded into 12 
mm capsule cups (Molecular Devices Corp.) at 4 x 10^ cells/cup in MEM 

25 supplemented with 10% fetal bovine serum, 2 mM L-glutamme, 10 U/ml penicillin, and 
10 iig/ml streptomycin. The cells are incubated in this medium at 37*C in 5% CO2 for 
24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
30 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 imits/ml 
penicillin, 10 jig/ml streptomycin, 26 mM NaCl) at a flow rate of 100 [il/minute. 
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Candidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 
38 seconds and is off for the remainmg 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded dxiring the cycle from 43-58 seconds, and the pump is 
5 re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in the 
rate of acidification are calculated by subtracting the baseline value (the average of 4 
rate measurements immediately before addition of a modulator candidate) from the 
highest rate measurement obtained after addition of a modulator candidate. The 
10 selected mstrument detects 61 mV/pH unit. Modulators that act as agonists of the 
receptor result in an increase in the rate of extracellular acidification compared to the 
rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

IS Example 15: mNgR3 does not bind hNogo-A(1055-l 120) 

To fimctionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to 
bind hNogo-A(l 055-1 120), a cDNA expression vector for a myc epitope-tagged 
mNgR3protein was created. The mouse NgR3 cDNA was amphfied by PGR from 
mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into 

20 the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc 
tag followed by the mature mNgR3 sequence. This plasmid was transfected into 
COS07cells, and expression of a nxyc-tagged protein of the predicted size was verified 
by immunoblot analysis. Alkaline phosphatase-4]Nogo-A(10S5-1120) binding studies 
and myc immunohistology were conducted as described (Foumier et al., supra). 

25 The cells expressing mNgR3 express the myc-tagged protein but bmding to 

AP-4iNogo-A(1055-l 120) was not observed under the conditions employed (Fig. 8). 

Example 16: Identification of partial human NgR3 cDNA and protein sequences 

The tblastn program was used to search for the human homolog of mouse 
30 NgR3. The mouse NgR3 protein sequence (SEQ ID N0:4) was used to query a 
proprietary human expressed sequence tag (EST) database from Incyte yielding one 
highly significant hit: Inqrte Template ID 190989.1. This sequence (937 nucleotides) 
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contains an open reading frame of 3 12 amino adds in the second reverse frame that 
exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID N0:4), 
strongly indicating that it is part of the human NgR3 homolog. 

A query of SEQ ID N0:4 against the public human EST database in Genbank 
5 also produced a hit with a 465-bp EST (Accession number: R3 5699; Version number: 
R35699.1; GI: 792600). There are a number of single nucleotide deletions and 
insertions within this sequence which cause frame shift errors. All of the reliable 
sequence contained in this public EST is present in the Incyte EST (Template ID 
190989.1). 

10 To obtain more nucleotide sequence that would extend the amino acid 

sequence at that caiboxy termmal end, the I.M.A.G.E. Consortium clone No. 383 19, 
which corresponds to Genbank accession No. R3S699, was purchased from Incyte 
Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of 
a Notl/EllnD m fragment containing the sequence of interest, cloned into the 

15 Notl/HBnD m sites of the vector Lafinid BA 

(http:/Amiage.M.govAimage/html/libs/lafinidBAshtnil). The clone was received as an 
agar stab, which was streaked out on LB agar plates containing 50ug/ml ampicillin to 
isolate individual colonies. Sk colonies were grown in LB medium with antibiotic, and 
plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA 

20 Purification System (Promega #A7500). These DNAs were subsequently digested 
with NotI and IfinD III restriction eozymes to confirm that the clones contained an 
insert. The insert of one isolate was sequenced using a combination of vector specific 
and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 
1 176 nucleotides (SEQ ID NO: 13). A translation of this sequence provides a partial 

25 sequence for human NgR3 of 392 amino adds (SEQ ID NO: 14). 

The nucleotide sequence of SEQ ID NO: 13 differs from the Incyte EST 
sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO: 13 are CG, 
whereas the corresponding nucleotides in the Incyte Template ID 190989. 1 are GT 
(i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In 

30 addition, position 641 in SEQ ID NO: 13 is a C, whereas the corresponding nucleotide 
in the Incyte Template ID 190989. 1 sequence is an A (i.e., position 641 of the 
complement of Incyte Template ID 190989.1). This results in two changes in amino 
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adds when comparing SEQ ID NO: 14 to the ORF encoded by Incyte Template 
190989.1: SEQ ID N0:14 contains a valine at position 5, whereas the ORF encoded 
by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO: 14 contains an 
alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989.1 
5 contains a glutamic acid. 

The nucleotide sequence of SEQ ID NO: 13 differs from the public EST 
(Accession number: R35699; Version number: R35699. 1; GI: 792600) sequence at 
two positions (within the first 200 nucleotides of reliable sequence). Nucleotide 
portions 12-13 in SEQ ID NO: 13 are CG, whereas the corresponding nucleotides in 
10 the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R35699; 
Version no: R3S699.1; GI: 792600) TUs leads to a single amino acid change when 
comparing SEQ ID NO: 14 to the ORF encoded by the public EST: SEQ ID NO: 14 
contains a valine at position 5, while the ORF encoded by the public EST contains a 
leucme. 

IS A Bestfit analy^s of the partial human amino add sequence with the full-length 

mouse amino acid sequence indicates that the human NgR3 amino acid sequence is 
complete at the carboxy terminal end and that they share 89.54% identity. An 
alignment of all the NgR proteins is shown in Figure 9. Although the human NgR3 
amino acid sequence is missing the first 25 ammo acids, it can be determined that the 

20 human NgiR3 protem contains the following features in common with the other NgR 
sequences: (1) eight Leucine Rich Repeat (IXR) domains; (2) an LRR caiboxy- 
terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a 
conserved potential gjiycosylation site in the eighth LRR domain; and (5) a 
hydrophobic carboxyl terminus. 

25 As those skilled in the art will appredate, numerous changes and modifications 

may be made to the preferred embodiments of the mvention without departing fi^om the 
spirit of the invention. It is intended that all such variations M within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated by 
30 reference. This application claims benefit from United States provisional appUcation 
60/238,361, filed October 6, 2000, which is incorporated by reference herein in its 
entirety. 
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Key for Sequence Listing: 



SEQIDNO:! 

SEQIDN0:2 
5 SEQIDN0:3 
SEQIDN0:4 
SEQIDN0:5 
SEQIDN0:6 
SEQIDNO:? 
10 SEQIDN0:8 
SEQIDN0:9 
SEQIDNO: 10 
SEQIDN0:11 
SEQIDN0:12 
15 SEQIDNO: 13 
SEQIDNO: 14 
SEQIDN0:15 
SEQIDNO: 16 

20 SEQIDN0:17 
SEQIDNO: 18 
SEQIDNO: 19 



human NgR2 cDNA sequence derived from graiomic sequence 
AC013606 

human NgR2 amino acid sequence 

mouse NgR3 cDNA sequence derived from AC021768 

a mouse NgR3 amino acid sequence 

a human NgRl amino acid sequence 

a cons«isus amino acid sequence for NgRs 

#1055-1120 amino add residues of hNogoA Ct^ogo-66) 

a mature human Ng|R2 amino add sequrace 

a mature mouse Ng£R3 amino add sequence 

a consensus NgR IXKNT amino acid sequence 

a consensus NgRLRRCT domain amino add sequence 

a consensus NgR LRR domain amino add sequence 

a partial human NgR3 nudeotide sequence 

a partial human NgR3 amino acid sequence 

a genomic sequence encoding a human NgR2 sequence. 

a genomic sequence (complementary strand) encoding a mouse 

NgR3 

a mouse Ng^l amino acid sequence 

a consensus sequence for the NTLRRCT domain of NgR 

an consensus NgR LRRCT domain amino add sequence 



25 



30 



wo 02/29059 



PCT/USOl/31488 



-138- 



CXAIMS 

What is claimed is: 

5 

1 . An isolated nucleic add comprising a nucleotide sequence encoding 
a polypeptide conq)riamg an LRRCT domain consisting of the amino add sequence: 

N X, W C X3 C R A R X4 L W Xj W X« X, Xg X, R Xio S S S X„ V 

10 

X12 C Xi4 P Xi5 Xi5 Xi7 Xjj Xi9 X20 D L X21 X22 L X23 X24 X25 D 

X2fi X^ X28 C [SEQ ID NO: 19] 

IS wherein X is any amino acid or a gap and the polypeptide does not comprise 

the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NgRl) or 
SEQ ID NO: 17 (mouse NgRl). 

2. The isolated nucleic acid according to claim 1, wherein X17 and X23 
20 are independmtly selected from the group consisting of: arginine and lysine. 

3. The isolated nucldc acid accordmg to claim 2, wherein the amino 
add sequence of the LRRCT domain is selected from the group consisting of: residues 
#261-3 10 of SEQ ID N0:2 and residues 261-3 10 of SEQ ID NO: 2 with up to 10 

25 cons^ative amino add substitutions. 

4. An isolated nucldc add encoding the polypeptide of SEQ ID NO: 2. 

5. An isolated nucleic add encoding the polypeptide of SEQ ID NO: 4 
30 (mouse NgR3) or SEQ ID NO: 14 (human NgR3). 



6. The isolated nucleic add according to claim 1, wherein the 
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polypeptide comprises: (a) aNTLRRCT domain, and (b) less than a complete CTS 
domain, provided that a partial CTS domain, if present, consists of no more than the 
&st 39 amino adds of the CTS domain. 

5 7. The isolated nucleic add to claim 1, wherein the polypeptide does 

not comprise an intact GPI domain. 

8. An isolated nucleic add consisting essentially of a nucleotide 
sequence complementary to a nucleotide sequence encoding a polypeptide sdected 
10 from the group consisting of: a polypeptide consisting of residues 3 1 1-395 of SEQ ID 
NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a 
polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nuddc add 
is from 8 to 100 nucleotides in lengtii. 

IS 9. A vector comprising the nucldc add of any one of claims 1, 4 or 5. 

. 10, A host cell comprising a vector according to claim 9. 

1 1 . A polypeptide comprising a LRRCT amino acid sequence: 

20 

NXiWXjC^^CRARX^LWXsWXgX^XgXjRXioSSSXiiV 

Xi2 C Xi3 Xi4 P Xi5 Xjg Xi7 Xjg Xi9 X20 D L X21 X22 L X23 X^^ X25 D 

25 XjfiXj^XzsCESEQIDNO: 19] 

herein X is any amino add residue or a gap and the polypeptide does not 
comprise the amino add sequence from residue 260 to 309 of SEQ ID NO: 5 
(human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

30 

12. The polypeptide according to claim 1 1, wherein X17 and X23 is 
selected from the group consisting of arginine and lysine. 
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13. The polypeptide according to claim 1 1, wherein X19 is glydne. 
[SEQIDNOill] 

14. The polypeptide according to claim 1 1, wherein the anuno acid 
sequence is selected from the group consisting of residues 261-3 10 of SEQ ID N0:2, 
residues 206-255 of SEQ ID NO: 14, residues 271-320 of SEQ ID N0:4 and amino 
acid sequences thereof comprising a conservative substitution. 

15. A polypeptide compri^ng a NTLRRCT amino acid sequence: 



C P Xi X2 C X3 C Y X4 X5 P X^ X7 T Xj S C X9 Xjo X^ Xi2 X^ X14 X13 Xjg P 

Xi7 X18 Xi9 P X20 X21 X22 X23 R X24 F L X25 5Qj5 N X27 I X2g X29 X30 X31 X32 X33 

X34 F X35 X35 X37 X38 X39 X40 X41 X42 L W X43 X44 S N X45 X46 X47 X48 1 X49 
X50 X51 X52 F X53 X54 X55 X55 X57 L E Xjg L D L X59 D N X^o X51 L X^ Xg3 X^ 
15 X55 P X55 T F X57 G L Xfig L X70 X71 L X72 L X73 X74 C X75 L X75 X77 L Xjg 

X„ Xgo Xg, F X,2 G L Xg4 L Q Y L Y L Q Xg, N Xgg X„ Xgg X,5 L D 

X91 X92 F D L X94 N L X95 H L F L H G N X95 X97 X93 X99 X^qo Xjoi X102 
X103 Xio4 F R G L XiQ5 X106 L D R L L L H X107 N Xlog X109 X^^q V H X^j 
X113 A F X114 X115 L X115 R L Xii7 X^g L 'X^^ L F X120 N X121 L X122 X123 L 
20 Xi24 X125 Xi26 X127 L Xi28 X129 L Xi3o X131 L Xi32 X133 L R L N X134 N X135 W 

Xi36 C X137 C R Xi38 R Xi39 L W Xi4Q W Xi4i Xi42 X143 Xj44 R X145 S S S Xi45 
V Xi47 C Xi48 Xi49 P Xi5Q Xi5i Xj52 Xjjj X134 Xjjj D L X^sg ^isy L Xjjg Xjjp X25Q 

D Xi,i Xi« Xies C [SEQ ID N0:18] 

25 wherein X is any amino acid residue or a gap and wherein the polypeptide is 

not the polypeptide of SEQ ID NO: 5 Oiuman NgRl) or SEQ ID NO: 17 
(mouse NgRl). 

16. The polypeptide according to claim IS, wherein X5, ^7 and X38 
30 represents a gap. 

17. A polypeptide comprising an amino sequence selected from the 



wo 02/29059 



PCT/USOl/31488 



- 141 - 

group consisting of: SEQ ID N0:2, SEQ ID N0:4 and SEQ ID NO: 14. 

18. The polypeptide according any one of claims 11, IS or 17, wherein 
the polypeptide comprises: (a) anNTLRRCT domain, and (b) less than a complete 

5 CIS domain, provided that a partial CTS domain, if present, consists of no more than 
the first 39 amino acids of the CTS domain. 

19. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the polypeptide does not comprise an intact GPI domain. 

10 

20. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the amino add sequence of the polypeptide further comprises an amino acid 
sequence of a heterologous polypeptide. 

15 21. The polypeptide according to claim 20, wherdn the heterologous 

polypeptide is an Fc portion of an antibody. 

22. A method of producing a polypeptide according to any one of 
claims 11, 15 or 17, comprising the steps of introducing an isolated nucleic add 

20 according to any one of claims 1, 4, 5 or 8 or a vector according to claim 9 into a host 
cell, culturing said host cell under conditions suitable for expression of said 
polypeptide, and recovering said polypeptide. 

23. An antibody that binds to a polypeptide of any one of claims 11,15 

25 or 17. 

24. A conq)osition comprising the polypeptide of claim 1 1, 15 or 17 
and a pharmaceulically acceptable carrier. 

30 25. A composition comprising the antibody of claim 23 and a 

phannaceutically acceptable carrier. 
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26. A method of decreasing inhibition of axonal growth of a CNS 
neuron, comprising the step of contacting the neuron with an effective amount of the 
polypeptide of claim 1 1, 15 or 17. 



injury, comprising administering to a mammal an effective amount of the polypeptide 
of claim 11, 15 or 17. 

28. A method of decreasing inhibition of axonal growth of a CNS 
10 neuron comprising the step of contacting the neuron with an effective amount of the 

antibody according to claim 23 . 

29. A method of treating a central nervous system disease, disorder or 
injury, comprising adnunistering to a mammal an effective amount of the antibody 

1 5 according to claim 23 . 

30. A method for identifying a molecule that binds a polypeptide of 
claim 1 1, 15 or 17 comprismg the steps of 

(a) providing a polypeptide of claim 1 1, 15 or 17; 
20 (b) contacting the polypeptide with the candidate molecule; 



5 



27. A method of treating a central nervous system disease, disorder or 



and 



(c) 



detecting binding of the candidate molecule to the 
polypeptide. 
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SEQUENCE LISTING 



<110> BIOGEN, INC. 

YALE UNIVERSITY 
STRITTMATTER, STEPHEN M. 
GATE, RICHARD L. 
SAH, DINAH W.y. 

<120> NOGO RECEPTOR HOMOLOGS 

<130> A116PCT 

<140> 
<141> 

<150> 60/238,361 
<151> 2000-10-06 

<160> 16 

<170> Patentin Ver. 2.1 

<210> 17 

<211> 1260 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atgctgcccg ggctcaggcg cctgctgcaa gctcccgcct cggcctgcct cctgctgatg 60 
ctcctggccc tgcccctggc ggcccccagc tgccccatgc tctgcacctg ctactcatcc 120 
ccgcccaccg tgagctgcca ggccaacaac ttctcctctg tgccgctgtc cctgccaccc 180 
agcactcagc gactcttcct gcagaacaac ctcatccgca cgctgcggcc aggcaccttt 240 
gggtccaacc tgctcaccct gtggctcttc tccaacaacc tctccaccat ctacccgggc 300 
actttccgcc acttgcaagc cctggaggag ctggacctcg gtgacaaccg gcacctgcgc 360 
tcgctggagc ccgacacctt ccagggcctg gagcggctgc agtcgctgca tttgtaccgc 420 
tgccagctca gcagcctgcc cggcaacatc ttccgaggcc tggtcagcct gcagtacctc 480 
tacctccagg agaacagcct gctccaccta caggatgact tgttcgcgga cctggccaac 540 
ctgagccacc tcttcctcca cgggaaccgc ctgcggctgc tcacagagca cgtgtttcgc 600 
ggcctgggca gcctggaccg gctgctgctg cacgggaacc ggctgcaggg cgtgcaccgc 660 
gcggccttcc gcggcctcag ccgcctcacc atcctctacc tgttcaacaa cagcctggcc 720 
tcgctgcccg gcgaggcgct cgccgacctg ccctcgctcg agttcctgcg gctcaacgct 780 
aacccctggg cgtgcgactg ccgcgcgcgg ccgctctggg cctggttcca gcgcgcgcgc 840 
gtgtccagct ccgacgtgac ctgcgccacc cccccggagc gccagggccg agacctgcgc 900 
gcgctccgcg aggccgactt ccaggcgtgt ccgcccgcgg cacccacgcg gccgggcagc 960 
cgcgcccgcg gcaacagctc ctccaaccac ctgtacgggg tggccgaggc cggggcgccc 1020 
ccagccgatc cctccaccct ctaccgagat ctgcctgccg aagactcgcg ggggcgccag 1080 
ggcggggacg cgcctactga ggacgactac tgggggggct acgggggtga ggaccagcga 1140 
ggggagcaga tgtgccccgg cgctgcctgc caggcgcccc cggactcccg aggccctgcg 1200 
ctctcggccg ggctccccag ccctctgctt tgcctcctgc tcctggtgcc ccaccacctc 1260 



<210> 2 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Leu Pro Gly Leu Arg Arg Leu Leu Gin Ala Pro Ala Ser Ala Cys 
1 5 10 15 

Leu Leu Leu Met Leu Leu Ala Leu Pro Leu Ala Ala Pro Ser Cys Pro 
20 25 30 
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Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys Gin Ala 
35 40 45 

Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr Gin Arg 
50 55 60 

Leu Phe Leu Gin Asn Asn Leu lie Arg Thr Leu Arg Pro Gly Thr Phe 
65 70 75 80 

Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu Ser Thr 
85 90 95 

lie Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu Leu Asp 
100 105 110 

Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr Phe Gin 
115 120 125 

Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin Leu Ser 
130 135 140 

Ser Leu Pro Gly Asn lie Phe Arg Gly Leu Val Ser Leu Gin Tyr Leu 
145 150 ' 155 160 

Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu Phe Ala 
165 170 175 

Asp Leu Ala^ Asn Leu Ser His Leu Phe Leu His Gly Asn Arg Leu Arg 
180 185 190 

Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp Arg Leu 
195 200 205 

Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala Phe Arg 
210 215 220 

Gly Leu Ser Arg Leu Thr lie Leu Tyr Leu Phe Asn Asn Ser Leu Ala 
225 230 235 240 

Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu Phe Leu 
245 250 255 

Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg Pro Leu 
260 265 270 

Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val Thr Cys 
275 280 285 

Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu Arg Glu 
290 295 300 

Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro Gly Ser 
305 310 315 320 

Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val Ala Glu 

325 330 335 



Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp Leu Pro 
340 345 350 

Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr Glu Asp 
355 360 365 

Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu Gin Met 
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60 



370 375 380 

Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly Pro Ala 
385 390 395 400 

Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu Val 
405 410 415 

Pro His His Leu 
420 



<210> 3 
<211> 1383 
<212> DNA 
<213> Mus sp. 

<400> 3 

atgtcttggc agtctggaac cacagtgaca caatctcccg tgcaggctgc tcaggtctca 

gggtgctgtg tggaattgct gctgttgctg ctcgctggag agctacctct gggtggtggt 120 

tgtcctcgag actgtgtgtg ctaccctgcg cccatgactg tcagctgcca ggcacacaac 180 

tttgctgcca tcccggaggg catcccagag gacagtgagc gcatcttcct gcagaacaat 240 

cgcatcacct tcctccagca gggccacttc agccccgcca tggtcaccct ctggatctac 300 

tccaacaaca tcactttcat tgctcccaac accttcgagg gctttgtgca tctggaggag 360 

ctagaccttg gagacaaccg acagctgcga acgctggcac ccgagacctt ccaaggcctg 420 

gtgaagcttc acgccctcta cctctataag tgtggactga gcgccctgcc cgcaggcatc 480 

tttggtggcc tgcacagcct gcagtatctc tacttgcagg- acaaccatat cgagtacctc 540 

caagatgaca tctttgtgga cctggtcaat ctcagtcact tgtttctcca tggtaacaag 600 

ctatggagcc tgggccaagg catcttccgg ggcctggtga acctggaccg gttgctgctg 660 

catgagaacc agctacagtg ggttcaccac aaggctttcc atgacctcca caggctaacc 720 

accctctttc tcttcaacaa cagcctcact gagctgcagg gtgactgtct ggcccccctg 780 

gtggccttgg agttccttcg cctcaatggg aatgcttggg actgtggctg ccgggcacgt 840 

tccctgtggg aatggctgcg aaggttccgt ggctctagct ctgctgtccc ctgcgcgacc 900 

cccgagctgc ggcaaggcca ggatctgaag ctgctgaggg tggaggactt ccggaactgc 960 

acaggaccag tgtctcctca ccagatcaag tctcacacgc ttaccacctc tgacagggct 1020 

gcccgcaagg agcaccatcc gtcccatggg gcctccaggg acaaaggcca cccacatggc 1080 

catccgcctg gctccaggtc aggttacaag aaggcaggca agaactgcac cagccacagg 1140 

aaccggaacc agatctctaa ggtgagctct gggaaagagc ttaccgaact gcaggactat 1200 
gcccccgact atcagcacaa gttcagcttt gacatcatgc ccaccgcacg acccaagagg 1260 
aagggcaagt gtgctcgcag gacccccatc cgtgccccca gtggggtgca gcaggcatcc 1320 
tcaggcacgg cccttggggc cccactcctg gcctggatac tggggctggc agtcactctc 1380 
cgc 

<210> 4 
<211> 461 
<212> PRT 
<213> Mus sp. 

<400> 4 

Met Ser Trp Gin Ser Gly Thr Thr Val Thr Gin Ser Pro Val Gin Ala 
15 10 15 

Ala Gin Val Ser Gly Cys Cys Val Glu Leu Leu Leu Leu Leu Leu Ala 
20 25 30 

Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr 
35 40 45 

Pro Ala Pro Met Thr Val Ser Cys Gin Ala His Asn Phe Ala Ala lie 
50 55 60 

Pro Glu Gly lie Pro Glu Asp Ser Glu Arg lie Phe Leu Gin Asn Asn 
65 70 75 80 
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Arg He Thr Phe Leu Gin Gin Gly His Phe Ser Pro Ala Met Val Thr 
85 90 9S 

Leu Trp He Tyr Ser Asn Asn He Thr Phe He Ala Pro Asn Thr Phe 
100 105 HO 

Glu Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin 
115 120 125 

Leu Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His 
130 135 140 

Ala Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly He 
145 150 155 160 

Phe Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His 
165 170 175 

He Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser 
180 185 190 

His Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Gin Gly He 
195 200 205 

Phe Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin 
210 215 220 

Leu Gin Trp Val His His Lys Ala Phe His Asp Leu His Arg Leu Thr 
225 230 235' 240 

Thr Leu Phe Leu Phe Asn Asn Ser Leu Thr Glu Leu Gin Gly Asp Cys 
245 250 255 

Leu Ala Pro Leu Val Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Ala 
260 265 270 

Trp Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Arg Arg 
275 280 285 

Phe Arg Gly Ser Ser Ser Ala Val Pro Cys Ala Thr Pro Glu Leu Arg 
290 295 300 

Gin Gly Gin Asp Leu Lys Leu Leu Arg Val Glu Asp Phe Arg Asn Cys 
305 310 315 320 

Thr Gly Pro Val Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr 

325 330 335 

Ser Asp Arg Ala Ala Arg Lys Glu His His Pro Ser His Gly Ala Ser 
340 345 350 

Arg Asp Lys Gly His Pro His Gly His Pro Pro Gly Ser Arg Ser Gly 
355 360 365 

Tyr Lys Lys Ala Gly Lys Asn Cys Thr Ser His Arg Asn Arg Asn Gin 
370 375 380 

He Ser Lys Val Ser Ser Gly Lys Glu Leu Thr Glu Leu Gin Asp Tyr 
385 390 395 400 

Ala Pro Asp Tyr Gin His Lys Phe Ser Phe Asp He Met Pro Thr Ala 
405 410 415 

Arg Pro Lys Arg Lys Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala 
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420 



425 



430 



Pro Ser Gly Val Gin Gin Ala Ser Ser Gly Thr Ala Leu Gly Ala Pro 
435 440 445 

Leu Leu Ala Trp lie Leu Gly Leu Ala Val Thr Leu Arg 



<210> 5 

<211> 473 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Lys Arg Ala Ser Ala Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu . Gin Ala Trp Gin Val Ala Ala Pro Cys Pro Gly Ala Cys Val 
20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Val Gly lie Pro Ala Ala Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg lie Ser His Val Pro Ala Ala Ser Phe Arg Ala Cys 

65 70 75* 80 

Arg Asn Leu Thr He Leu Trp Leu His Ser Asn Val Leu Ala Arg He 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Ala Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu Arg Ser Val Asp Pro Ala Thr Phe His Gly 

115 120 125 

Leu Gly Arg Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Gin Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 155 " 160 

Leu Gin Asp Asn Ala Leu Gin Ala Leu Pro Asp Asp Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg He Ser Ser 
180 185 190 

Val Pro Glu Arg Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn Arg Val Ala His Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Ala 
225 230 235 240 

Leu Pro Thr Glu Ala Leu Ala Pro Leu Arg Ala Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg TVla Arg Pro Leu Trp 



450 



455 



460 
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260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Ser 
275 280 285 

Leu Pro Gin Arg Leu Ala Gly Arg Asp Leu Lys Arg Leu Ala Ala Asn 

290 295 300 

Asp Leu Gin Gly Cys Ala Val Ala Thr Gly Pro Tyr His Pro He Trp 
305 310 315 320 

Thr Gly Arg Ala Thr Asp Glu Glu Pro Leu Gly Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Ser 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Pirg His He Asn Asp Ser Pro Phe 
370 375 380 

Gly Thr Leu Pro Gly Ser Ala Glu Pro Pro Leu Thr Ala Val Arg Pro 
385 390 395 400 

Glu Gly Ser Glu Pro Pro Gly Phe Pro Thr Ser- Gly Pro Arg Arg Arg 
405 410 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
420 425 430 

Gin Ala Gly Ser Gly Gly Gly Gly Thr Gly Asp Ser Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ser Leu Thr Cys Ser Leu Thr Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 



<210> 6 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3) (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (9) (10) 
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<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (12).. (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (18) . . (25) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (27).. (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (31).. (34) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (36) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (39) . . (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (42) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (44).. (50) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (52).. (59) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (62).. (63) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (66) .. (69) 

<223> Variable amino acid 

<220> 

<221> MOD RES 
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<222> (71) (74) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (76) (80) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (83) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (87) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (90) (91) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (94) (96) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (98) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (101) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104).. (105) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (107) 

<223> Variable amino acid 
<220> 

<22I> MOD_RES 
<222> (109) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (111).. (112) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 
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<221> MOD_RES 

<222> (116).. (117) 

<223> Variable amino acid 

<220> 

<221> MOD_RBS 

<222> (119) . . (122) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (124) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (127) . . (128) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (136) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (138) . . (141) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (143) 

<223> VariaQale amino acid 

<220> 

<221> MOD_RES 
<222> (146) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (148) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (151) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (154) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (162) . . (170) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (175) (176) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (184) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (186) . . (189) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (196) . . (197) 

<223> Variable amino acid 

<220> 

<221> MOD_RBS 
<222> (199) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (202) . . (203) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (205) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (208) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (210) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (212) . . (213) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (215) . . (218) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (22.1) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (223) . . (224) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (226).. (227) 

<223> Variable amino acid 

<220> 

<221> MOD^RES 
<222> (232) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (234) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (236) 

<223> Variable amino acid 
<220> 

<221> MOD__RES 
<222> (238) 

<223> Varicible amino acid 

<220> 

<221> MOD_RES 
<222> (243) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (246) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (248).. (251) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (253) 

<223> Variable amino acid 
<220> 

<221> MOD^RES 
<222> (257) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (259) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (261) (262) 

<223> Variable amino acid 

<220> 

<221> MOD__RES 

<222> (264) • , (267) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (269) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (272) . . (273) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (275) . . (277) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (279) . . (281) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (283) . - (287) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (289) . . (290) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (292) . . (328) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (330) (341) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (344) . . (346) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (348) • , (399) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (401) . . (428) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (431) . . (439) 

<223> Variable amino acid 

<400> 6 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr 
1 5 



Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
10 15 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Pro Xaa Xaa 

20 25 30 

Xaa Xaa Arg Xaa Phe Leu Xaa Xaa Asn Xaa He Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser 
50 55 60 

Asn Xaa Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Leu Glu Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa 
85 90 95 

Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa 
100 105 110 

Cys Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa 

115 120 125 

Leu Gin Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp 
130 135 140 

Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly 
145 150 155 160 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa 
165 170 175 

Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa 
180 185 190 

Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa 
195 200 205 

Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa 

210 215 220 

Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys Arg 
225 230 235 240 

Ala Arg Xaa Leu Trp Xaa Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser Ser 
245 250 " 255 

Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly Xaa Asp Leu Xaa 

260 265 270 

Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Pro 
275 280 285 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 335 

Xaa Xaa Xaa Xaa Xaa Pro Pro Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa 
340 345 350 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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355 360 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
385 390 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
405 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
420 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 
435 440 



365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg 
395 400 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
410 415 

Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
425 430 



<210> 7 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Arg He Tyr Lys Gly Val He Gin Ala He Gin Lys Ser Asp Glu Gly 
15 10 15 

His Pro Phe Arg Ala Tyr Leu Glu Ser Glu Val Ala He Ser Glu Glu 

20 25 ' 30 

Leu Val Gin Lys Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr 
35 40 45 

He Lys Glu Leu Arg Arg Leu Phe Leu Val Asp Asp Leu Val Asp Ser 
50 55 60 

Leu Lys 
65 



<210> 8 

<211> 390 

<212> PRT 

<213> Homo sapeins 

<400> 8 

Cvs Pro Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys 
1 5 10 15 

Gin Ala Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr 
20 25 30 

Gin Arg Leu Phe Leu Gin Asn Asn Leu He Arg Thr Leu Arg Pro Gly 
35 40 45 

Thr Phe Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu 
50 55 60 

Ser Thr He Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr 
85 90 95 
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Phe Gin Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin 
100 105 110 

Leu Ser Ser Leu Pro Gly Asn He Phe Arg Gly Leu Val Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu 
130 135 140 

Phe Ala Asp Leu Ala Asn Leu Ser His Leu Phe Leu His dly Asn Arg 
145 150 155 160 

Leu Arg Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp 
165 170 175 

Arg Leu Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala 
180 185 190 

Phe Arg Gly Leu Ser Arg Leu Thr He Leu Tyr Leu Phe Asn Asn Ser 
195 200 205 

Leu Ala Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg 
225 230 235 240 

Pro Leu Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val 
245 250 255 

Thr Cys Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu 
260 265 270 

Arg Glu Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro 
275 280 285 

Gly Ser Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val 
290 295 300 

Ala Glu Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp 
305 310 315 320 

Leu Pro Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr 
325 330 335 

Glu Asp Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu 
340 345 350 

Gin Met Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly 
355 360 365 

Pro Ala Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu 
370 375 380 

Leu Val Pro His His Leu 
385 390 



<210> 9 
<211> 421 
<212> PRT 
<213> Mus sp. 

<400> 9 
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Cys Pro Arg Asp Cys Val Cys Tyr Pro Ala Pro Met Thr Val Ser Cys 
15 10 15 

Gin Ala His Asn Phe Ala Ala He Pro Glu Gly He Pro Glu Asp Ser 
20 25 30 

Glu Arg He Phe Leu Gin Asn Asn Arg He Thr Phe Leu Gin Gin Gly 
35 40 45 

His Phe Ser Pro Ala Met Val Thr Leu Trp He Tyr Ser Asn Asn He 
50 55 60 

Thr Phe He Ala Pro Asn Thr Phe Glu Gly Phe Val His Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg Gin Leu Arg Thr Leu Ala Pro Glu Thr 
85 90 95 

Phe Gin Gly Leu Val Lys Leu His Ala Leu Tyr Leu Tyr Lys Cys Gly 
100 105 110 

Leu Ser Ala Leu Pro Ala Gly He Phe Gly Gly Leu His Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Asp Asn His He Glu Tyr Leu Gin Asp Asp He 
130 135 140 

Phe Val Asp Leu Val Asn Leu Ser His Leu Phe Leu His Gly Asn Lys 
145 150 155* 160 

Leu Trp Ser Leu Gly Gin Gly He Phe Arg Gly Leu Val Asn Leu Asp 
165 170 175 

Arg Leu Leu Leu His Glu Asn Gin Leu Gin Trp Val His His Lys Ala 
ISO 185 190 

Phe His Asp Leu His Arg Leu Thr Thr Leu Phe Leu Phe Asn Asn Ser 
195 200 205 

Leu Thr Glu Leu Gin Gly Asp Cys Leu Ala Pro Leu Val Ala Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Gly Asn Ala Trp Asp Cys Gly Cys Arg Ala Arg 
225 230 235 240 

Ser Leu Trp Glu Trp Leu Arg Arg Phe Arg Gly Ser Ser Ser Ala Val 

245 250 255 

Pro Cys Ala Thr Pro Glu Leu Arg Gin Gly Gin Asp Leu Lys Leu Leu 
260 265 270 

Arg Val Glu Asp Phe Arg Asn Cys Thr Gly Pro Val Ser Pro His Gin 
275 280 285 

He Lys Ser His Thr Leu Thr Thr Ser Asp Arg Ala Ala Arg Lys Glu 
290 295 300 

His His Pro Ser His Gly Ala Ser Arg Asp Lys Gly His Pro His Gly 
305 310 315 320 

His Pro Pro Gly Ser Arg Ser Gly Tyr Lys Lys TQa Gly Lys Asn Cys 
325 330 335 

Thr Ser His Arg Asn Arg Asn Gin He Ser Lys Val Ser Ser Gly Lys 
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340 345 350 

Glu Leu Thr Glu Leu Gin Asp Tyr Ala Pro Asp Tyr Gin His Lys Phe 
355 360 365 

Ser Phe Asp lie Met Pro Thr Ala Arg Pro Lys Arg Lys Gly Lys Cys 
370 375 380 

Ala Arg Arg Thr Pro lie Arg Ala Pro Ser Gly Val Gin Gin Ala Ser 
385 390 395 400 

Ser Gly Thr Ala Leu Gly Ala Pro Leu Leu Ala Trp He Leu Gly Leu 
405 410 415 

Ala Val Thr Leu Arg 
420 



<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3).. (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (9) . . (10) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable azaino acid 
<400> 10 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
15 10 15 

Cys 



<210> 11 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (14) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (16) . . (19) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (21) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (25) 

<223> Variable amino acid 

<220> 

<221> MOD^RES 
<222> (27) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (29).. (30) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32) . . (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (37) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (40).. (41) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (43).. (45) 

<223> Variable amino acid 



<220> 

<221> MOD__RES 

<222> (47) . . (49) 

<223> Variable amino acid 



Asn Xaa Trp Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa Trp Xaa 
1 5 10 15 

Xaa Xaa Xaa Arg Xaa Ser Ser Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 
20 25 30 

Xaa Xaa Xaa Gly Xaa Asp Leu Xaa Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa 
35 40 45 



Xaa Cys 
50 



<210> 12 
<211> 196 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_HES 
<222> (2) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (5).. (6) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (8) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (10).. (16) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (18).. (25) 

<223> Variable amino acid 

<220> 
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<221> MOD_RES 

<222> (28) . • (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32).. (35) 

<223> Variable amino acid 

<220> 

<221> MOD^RES 

<222> (37) (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (42).. (46) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (49) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (53) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (56) (57) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (60).. (62) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (64) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (67) 

<223> Variable amino acid 

<220> 

<221> MOD_RBS 

<222> (70) (71) 

<223> Variable amino acid 

<220> 

<221> MOD_RBS 
<222> (73) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (75) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (77) . . (78) 

<223> Variable amino acid 

<220> 

<221> MOD__RBS 
<222> (80) 

<223> Variable amino acid 

<220> 

<221> MOD^RES 

<222> (82) . • (83) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (85) . . (88) 

<223> Variable amino acid 

<220> 

<221> MOD__RES 
<222> (90) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (93) (94) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (102) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104).. (107) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD^RES 
<222> (112) 

<223> Variable amino acid 



<220> 

<221> MOD__RES 
<222> (114) 
<223> Variable 



amino acid 



<220> 

<221> MOD_RES 
<222> (117) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (120) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (128) . . (136) 

<223> Variable aiaino acid 

<220> 

<221> MOD_RES 

<222> (141) . . (142) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (150) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (152) (155) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (158) . . (159) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (162) (163) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (165) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (168) . • (169) 

<223> Variable amino acid 

<220> 

<221> MOD__RES 
<222> (171) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (174) 
<223> Variable 

<220> 

<221> MOD_RES 
<222> (176) 
<223> Variable 



amino acid 



amino acid 



<220> 

<221> MOD_RES 

<222> (178) . . (179) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (181).. (184) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (187) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (189) •.(190) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) (193) 

<223> Variable amino acid 

<400> 12 

Arg Xaa Phe Leu Xaa Xaa Asn Xaa He Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 
20 25 30 

Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Glu 
35 40 45 

Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa 
50 55 • 60 

Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa' Leu Xaa Xaa Cys Xaa 
65 70 75 80 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa Leu Gin 
85 90 95 

Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa 
100 105 110 

Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly Asn Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu Asp 
130 135 140 

Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala 
145 150 155 160 

Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa Asn Xaa 
165 170 175 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa 
180 185 190 

Xaa Leu Arg Leu 
195 



<210> 13 
<211> 1176 
<212> DNA 

<213> Homo sapiens 

<400> 13 

gagggcatcc ccgtggacag cgagcgcgtc ttcctgcaga acaaccgcat cggcctcctc 60 
cagcccggcc acttcagccc cgccatggtc accctgtgga tctactcgaa caacatcacc 120 
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tacatccacc ccagcacctt cgagggcttc 
aaccggcagc tgcggacgct ggcacccgag 
ctctacctct acaagtgtgg gctcagcgcc 
agcctgcagt acctctacct gcaggacaac 
gtggacctgg tcaacctcag ccacctgttt 
ccgggcacct tccggggcct ggtgaacctg 
cagtgggtcc accacaaggc attccacgac 
aacaacagcc tctcggagct gcagggtgag 
ctccgcctca acggcaaccc ctgggactgt 
ctgcagaggt tccggggctc cagctccgct 
ggccaggacc tgaagctgct gagggccgag 
ccgcaccaga tcaagtcaca cacgctcacc 
cactcacccc acggccccac caggagcaag 
aggaagccgg ggaagaactg caccaacccc 
gccgggaaac aggcccccga gctgccagac 
tttgacatca tgcctacggc ccggcccaag 
atccgtgccc ccagcggggt gcagcaggcc 
ctggcctgga cactggggct ggcggtcact 



gtgcacctgg aggagctgga cctcggcgac 180 
accttccagg gcctggtgaa gcttcacgcc 240 
ttgccggccg gcgtctttgg cggcctgcac 300 
cacatcgagt acctccagga cgacatcttc 360 
ctccacggca acaagctgtg gagtctgggc 420 
gaccgtcttt tgctgcacga gaaccagctg 480 
ctccgcaggc tgaccaccct cttcctcttc 540 
tgcctggccc cgctgggggc cctggagttc 600 
ggttgtcgcg cgcgctccct gtgggaatgg 660 
gtcccctgtg tgtcccctgg gctgcggcac 720 
gacttccgga actgcacggg accagcgtcc 780 
accaccgaca gggccgcccg caaggaacac 840 
ggccacccgc acggcccccg gcccggccac 900 
aggaaccgca atcagatctc taaggcgggc 960 
tatgccccag actaccagca caagttcagt 1020 
aggaagggca agtgtgcccg caggaccccc 1080 
tcctcggcca gttccctggg ggcctccctc 1140 
ctccgc 1176 



<210> 14 

<211> 392 

<212> PRT 

<213> Homo sapiens 

.<400> 14 

Glu Gly He Pro Val Asp Ser Glu Arg Val Phe Leu Gin Asn Asn Arg 
1 5 10 1^ 

He Gly Leu Leu Gin Pro Gly His Phe Ser Pro Ala Met Val Thr Leu 
20 25 30 

Trp He Tyr Ser Asn Asn He Thr Tyr He His Pro Ser Thr Phe Glu 
35 40 45 

Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin Leu 
50 55 60 

Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His Ala 
65 70 75 80 

Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly Val Phe 
85 90 95 

Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His He 
100 105 110 

Glu Tyr Leu Gin Asp Asp He Phe Val Tlsp Leu Val IKsn Leu Ser His 
115 120 125 

Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Pro Gly Thr Phe 
130 135 140 

Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin Leu 
145 150 155 160 

Gin Trp Val His His Lys Ala Phe His Asp Leu Arg Arg Leu Thr Thr 
165 170 175 

Leu Phe Leu Phe Asn Asn Ser Leu Ser Glu Leu Gin Gly Glu Cys Leu 
180 185 190 

Ala Pro Leu Gly Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Pro Trp 
195 200 205 
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Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Gin Arg Phe 
210 215 220 

Arg Gly Ser Ser Ser Ala Val Pro Cys Val Ser Pro Gly Leu Arg His 

225 230 235 240 

Gly Gin Asp Leu Lys Leu Leu Arg Ala Glu Asp Phe Arg Asn Cys Thr 
245 250 255 

Gly Pro Ala Ser Pro His Gin lie Lys Ser His Thr Leu Thr Thr Thr 
260 265 270 

Asp Arg Ala Ala Arg Lys Glu His His Ser Pro His Gly Pro Thr Arg 
275 280 285 

Ser Lys Gly His Pro His Gly Pro Arg Pro Gly His Arg Lys Pro Gly 
290 295 300 

Lys Asn Cys Thr Asn Pro Arg Asn Arg Asn Gin lie Ser Lys Ala Gly 
305 310 315 320 

Ala Gly Lys Gin Ala Pro Glu Leu Pro Asp Tyr Ala Pro Asp Tyr Gin 

325 330 335 

His Lys Phe Ser Phe Asp lie Met Pro Thr Ala Arg Pro Lys Arg Lys 
340 345 350 

Gly Lys Cys Ala Arg TUrg Thr Pro lie Arg Ala Pro Ser Gly Val Gin 
355 360 365 

Gin Ala Ser Ser Ala Ser Ser Leu Gly Ala Ser Leu Leu Ala Trp Thr 
370 375 380 

Leu Gly Leu Ala Val Thr Leu Arg 
385 390 



<210> 15 

<211> 143899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modified__base 

<222> (2044) (2144) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified^base 
<222> (6609) 

<223> 3/ t, c, gr other or unluiown 

<220> 

<221> modified^base 

<222> (6625) . . (6724) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (14153) . . (14252) 

<223> a, t, g, other or unknown- 

<220> 

<221> modified base 
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<222> (19512).. (19611) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (22595) . . (22694) 

<223> a, t, c, g, other or unknovm 

<220> 

<221> modified_base 

<222> (27825) . . (27924) 

<223> a, t, C/ g, other or unknovm 

<220> 

<221> inodified_base 

<222> (34953) . . (35052) 

<223> a, t, c, 9/ other or unknown 

<220> 

<221> modified_base 

<222> (40783) . . (40882) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (49000) . . (49099) 

<223> a, t, c, g, other or unknown 

<220> 

<221> inodified_base 

<222> (62884) . . (62983) 

<223> a, t^ c, g, other or unknown 

<220> 

<221> modified_base 

<222> (75528) . . (75627) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (87944) . . (88043) 

<223> a, t; c, g, other or unknown 

<220> 

<221> xnodifiedjDase 

<222> (111030) . . (111129) 

<223> a, t, c, g, other or unknown 

<400> 15 

aagcacatac aggtgacatt acagaactga cagttatgcc aggcactgta cttagcccct 60 

ataccatcct caaacagctg tatgatgtag attgggtatt aaccccatta ataacaaaag 120 

tacagggaac aaagtgactt tccaaaggtc atgccattca aaggagggtg aatcttaggt 180 

tggacgcagg ctgtctgact ctggagtctg aggtgttaat gctgcctcct ccatgggaac 240 

agcccaagtg aaaaacagct gatccactct tcatttactt ggcatctgtg ctaagctggt 300 

ccctgagcca agctctgagc aacagaaaca gaagctctgc attaggagct tgtgagcatg 360 

tcaatgccgg gtaaaggagt gctggaaacc gctgggatgg ccgccgagca ctaggccgtt 420 

gaaggtgggc tctgtgtgac tggttcctct acactctggc ctggctgcct gcaggaagaa 480 

gatcaagctg agtgggctgg ccctggacca caaggtgaca ggtgacctct tctacaccca 540 

tgtgaccacc atgggccaga ggctcagcca gaaggccccc agcctggagg acggttcgga 600 

tgccttcatg tcaccccagg atgttcgggg cacctcagaa aaccttcctg agagtgagtg 660 

tctggtcaag gtgccggcct tgggggatag tgatggtggg tcctcatatt cagtgagcac 720 

tcatggttga gtatttattc gcacccctct tcagtcctta caacacccca tgatgtaggt 780 

ggggcatgct cctcatttac agatgggcac atcaaagctc agctaacgct gggaagttca 840 

gattcagggt taccctgctg gattcctggg attggggagg gaggagcttc caaaatgggg 900 
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acaaggtctc tgggcctgtc gggtagctgg tttcctcagg gccccttgca acctctgagc 960 

ttattgcatc aggtgcagcc aggcccgtga gcctcctggc aggggtcctc cacacctggc 1020 

tgtcttttgc cccctgctgg tcacaggagg agctgcagca cctgcctggg ctgcttctca 1080 

ggagggtaca tgaagatccc aggaccgcca gctccatgat aagtggaagg agctccttgg 1140 

agtcaggagc gggagttgag gagtttgagt cctgctctcc agttataggc tatgtgactt 1200 

gtgtagatca cctaaccttg ctcttgattt ccttacctct taaactagca ctaaaagcac 1260 

cccacaaact gtaaagttag ttgtgatgat tgaatgacac catgggtgtg gaagctcttt 1320 

gtaaagtgca aaacggtgtg cagtttgagg gtggttaccc ccagtgccga ttctcagagg 1380 

gcaacatggc taagggcacg agctggagtt aggctgacct gctgcttcca gccctgtgag 1440 

cttgagcaag tcatttaact tcctgagctg cagtttcctc atcagtaaaa tgtgataagg 1500 

atagggttgt tgtaagattt tattaaatgg ggtaataaat gtcaagtatg tagcccatag 1560 

tgagtgcttc agagtttttt tcttttgttt ctttcccccc cgccccgaga tggagcctta 1620 

ctctgttgcc caggctggag tgcagtggca tgatcttggc tcactgcaac ctccgcctcc 1680 

cgggttcaag caattctcct gcctcagcct cccaaatagc tgggactaca ggcgtgcacc 1740 

accatgctcg gctaattttt gtatctttag tagagacggg gtttcaccat gttggccagg 1800 

ctggtctcga actcctgacc tcatgatgct cctgcctcag cccccgaaag ttttgggatt 1860 

acaagtgtga gcccccgtgc cctgccaggt tttttttttt tttttttttt tgtaaaacac 1920 

ccacagggta ttgctgttgc ctgggctgga gtgcggtagt gcaatcatag ttcactgcag 1980 

ccttgacctc ctgggctcaa gtgatcctcc tgcctcagcc tcctgagtag ctgggaatac 2040 

aggnnnnnnn nnnnnnnnnn nnnnnnnnnn rmnnnniinnn nnnnnnnnnn nnnnnnnnnn 210O 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttttgta ttttaagtag 2160 

agacagggtt ttcccaatgt tggccaaggc tggtctaaaa ctcccaacct caggtgatcc 2220 

acccacctca gcctcccaaa gtactgggat tacaggcgtg agacaccgtg cccagccagg 2280 

aggcttattt tcttgataaa ttacccagtc tcaggtattt ctctacagcg atgcaagaac 2340 

agcctaatac atccaggctc agcatcagtg gacccaggtg ggagagctta agatgtcaag 2400 

gtctgaatgc cgcttccaca cacctttggg acctagggac tccctctctt tttctttttt 2460 

cagtagaaga tgttatcttc tcctttctct gaccagtagt- tggtgatggt ttcagagata 2520 

gtttttcagt caagatatat ttcagtggct tcactgagcc caagttccct cgcctctcta 2580 

ggactttatt tccttgtttc tagaagaggg ataacacata' ttttctaagg tggttgtgag 2640 

attaagggag ctggtaccgg gtggtgcata aggacaggat agagcaatgg tgagaccact 2700 

caaaaagcga aaagttgacc tgcgagggtg acacttatca aatcagcaca cagtgggagt 2760 

ggaaggaatg tccctcatca gttacaatat ttggagagtg caagttatag aaaacccagc 2820 

cctggccggg cgcggtgggt catgcctata atcccagcac tttgggaggc tgaggcaggt 2880 

ggatcacgag gtcaggagtt caagaccagc ctgaccaacg tggtgaaacc ccacctctac 2940 

taaaaataca aaattagctg ggcgtggtgg tgtgtgcctg taatctgagc tactcaggag 3000 

gctgaggcac gagaatcact tgaaaccggg aggtggagtt tgcagtgagc cgagatcgca 3060 

ccactgcact ccagcctggg caacagagcg agactccatc tcaaacgaaa aaaaaaaaag 3120 

aaagaaaacc cagctctaac tggcttaaac agtaagaaga tctattatat tatccatctc 3180 

aggcagcagc aagcccagag gtaggggact ccaaggttgg ttgatccagg gcttaacgat 3240 

gtcatcaaag acccaggttc tttctgtctc ggcacctctg tctgcagggc cagcttcatc 3300 

ctaagccaga ttgttcttgt cttgattaca agttggctgc tgggccagca gacgctgcct 3360 

gcctccctgt tcatcttcag aagtagaaag tggcccttcc ccagtcatgg aatgaaagag 3420 

tttcctttct gtctgggatt gcttaggtcc acccacctga agccaatgac tgtcaccagg 3480 

aaggtaatat acactgattg tcttaagtca gggttcctga gccagtcttg ggcaaggagt 3540 

gtgatactgt catgattgtc ttgggctcat cagggcagct ctgcagatga gatcaaactc 3600 

caagctacat tattctgaac agtgggaagt aggaaagaga cattttggga gatacaaaac 3660 

acaatgtcta tcccatatcc ctaggtccag gtcacagtgt cttggttgga catcaaatgt 3720 

agaaaaagaa agactgtcca tccatttatc tacctattca tctggttttt gatttttttt 3780 

aaattttatt ttaagacatt ctcactctgt cacccagact ggagtgcagt ggtttgatca 3840 

tggctcatgg cagcctcaac ctcccaggct caagtgaccc tcccatgctc aagtgatcct 3900 

cctacctcag cctcccaagt agctagaact aaaggtgcat gccaccacgc tcagttaatt 3960 

tttgcatttt ttgtagagat ggggtttcgt catgatgccc atgctagtct ggaattcctg 4020 

aactcaagca atatgcctgc ctttgcctcc caaaatgctg ggattgtagg catgagccac 4080 

tgctcctggc tcatctgttt aataatttat gaaacaacta ctgggtgctg agcacggggc 4140 

caggggctgg agatctagca gggaccaggc agatctctgc caagtcgttg gtttcttaaa 4200 

ggttttgctc ataattcccc ttttcttttc tctttcgttt tttttctttt ctttctttct 4260 

ttctttcttt tttttttttt gagacagagt ctcactctgt tacccaggct ggagtgcagt 4320 

ggtgcgatct cagctcactg caacctctgc ctcctgggtt caagcgattc tcctgcctca 4380 

gcctcccgag tagctgggac tacaggcgcc tgccaccatg cccggctaat ttttgtgttt 4440 

ttagtagaga ctgggtttca ccatattggc caggctggtc ttgaactcct gaccttgtga 4500 

tccgcccgct tcggcctccc acagtgctgg gattacaggc gtgagccacg gcgcccagcc 4560 

agtttccctt ttcaatgagg cctccctgac ctccatactc tactcctcca cctggcccac 4620 

tcagctctac tttttcttcc ccatagcact caagacctcc taacatacta cgtaagttat 4680 

ttatttacta ggcttactgt gtattgtctg tcttcctcta ctagaatgta aactccatga 4740 



27 



wo 02/29059 



PCT/USOl/31488 



gaatagaaat ttttgccttt ttatttagtg tggtgtctgc agcccctggc ttagtccctg 4800 

gcatacaaca gtcactccac ccacagttgc tgaataagtg actaaaggtc cctgccctca 4860 

tattgttatg agggagtgtg catgttgtta gagaaaaatc tgaggcacaa taaaatttta 4920 

tagagtttaa gttttctttt ttaagcaatc cacgaattgg ggtagtttca. gaggtagttt 4980 

ttcagtcatg acgtatttca atggcttcac tgagcccaag ttctttcacc tctctaggac 5040 

tttatttcct tatttctaga acggggataa cacatagttc ataaggcagt tatgagagta 5100 

agggagctgg tatggggtga tgcataagga caggatagag cagtggtgag accgctcaga 5160 

tgacaaagcg tcagagacca gtatttacga cggaaatgtg gaagcatgat aaagaaatta 5220 

tttgggctgg gcacaatgac tcacaactaa taaaactttg ggaggccaag gtgggaggat 5280 

cacttgactt gcagaaggtc aaggctgcag tgagctgtga ttttgccact gcactccagc 5340 

ctggtcaaca gagtgagacc ctggctcgaa acgttatttg attggttaca gttatacagt 5400 

tgccttattt ggtctattcc atttgaaagt tcctagttct ataattttaa gtttgttggc 5460 

tgtttctgat tggttaagct taagttttgt tttcctttaa tacagttaag tgccccataa 5520 

tgacattttg gtcaaggaca gaccacatat acagtggtgg tcccataaga ttataatgga 5580 

gctgaaacat tcctattgtc tatggcgtag tggtcctgat gttgtagcgc aatgcattag 5640 

ttatatgttt gtggcaatgc tggtgtaaac acacctactg cactgccagt gatataaaag 5700 

aatagcacat acagttatat atagtacata atatctgata atgataatac ataactatat 5760 

tactggttta tatatttact atattattta tctttatttt atttttgaga cagagtctca 5820 

ttctgtcacc caggctggag tgcagtggcg cgatcttggc tcaccgcaac ctccgcttcc 5880 

tgggttcaag tgattctcct gcctcagtct cctgagtagc tgggattaca ggtgtgcacc 5940 

atgacaccct gctaatatgt tttgtatttt tagtagagat ggggtttcac catgttggcc 6000 

aggctggtct tgaactactg acctcaagtg atcaccccgc ctcggcttcc caaagtgctg 6060 

ggattacagg cgtgagccac cacgcatggc ctatttataa ttattttaga gtgtacgcct 6120 

tatacttata aaaaaaagct aactgtcaaa cagcctcggg caggtccttc aacagatatt 6180 

ccagaagaca ttgttatcat aggagatgac agctccgtgc atattattgt ccctgaaaac 6240 

cttctagtgt ggaagtggaa gacagtgata ttgatgatag gacccagtgt aggcctaggc 6300 

taatgtgtgt gtttgtgtct ttgcttttaa caagaaagtt taaaaagtta aaataaaata 6360 

caaaaatttt taaatagaaa aaagctgccc aggaacaatg gctcacacct gtaatcccac 6420 

cattcgggga ggccaaggtg ggtggattgc ttgagctcag gagttcaaga ccagcctggg 6480 

caacatggtg aaaccccatc tctacaaaaa atacaaaaat tagccgggtg tggtggcatg 6540 

cggctatagt tccagctaat cgaggggctg aggtgggagg atcactgggg gggaggtggt 6600 

tgaggctgna gtgagctgtg attgnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6720 

nnnnatattc ttaaaaaaat ttttttttat ttttgagaca gaatttctct cttgttgccc 6780 

aggctggagt gcaatggcgc tatctcagct cagggcaacc tccacctcct gggttcaagc 6840 

gattctcctg ccttagcctc ccaggtacag gcgcccgcca ccatgctcgg ctaatttttg 6900 

tatttttagt agagatgggg tttcaccatg ttgtccaggc tggtcttgaa atcctgcctc 6960 

aggtgatcca cccccctcgg cctcccaaag tgctggaatt tacaggcgtg agccactgtg 7020 

cctggcctcc tttacatttt tttaaattta attttaattt tttaattttt aatttctcat 7080 

atatatatat ttttaagact agccaagtga agcagtggga gtggaaaagg aactggtttt 7140 

gatcaatagg tgtaaacacc actgcactgg gaccagccta ttttacattc ctgttagcag 7200 

tgatgagggt tcactttctt tgtagcctca acaatatgtg tcgttgccca tctttttttt 7260 

tttttttttt tttttttttg agatggagtc tcactctgtt gcctaggctg gaatgcaatg 7320 

gcatgatctc agctcactgc aacctccgcc tcccaggttc aagtgattct tgtgtctcag 7380 

cctcctgagt agatgggatt acaggcgtcc accaccacgc ccggctaatt ttttgtattt 7440 

tcagtagaga tggggtttca ccatgttggc caggttggtt tcgaactcct gacctcaagt 7500 

gatccgccca cctcggcctc ccaaagtgct gggattacag gcatgagcca ccgcgcccgg 7560 

cctgcccatc ttttttttgt tatagccatc ctagtggatg taaagttttt ttgtgatttt 7620 

gatttgtgtt tccctactga tcaatgatgt tgagcatctt ttcctgtgct tattggcttt 7680 

tggtatatct ttggagaaag gtctattcag gtcctttgcc cactttaaaa ttaggttatc 7740 

tttctattac tgagatgtaa gagttcttta tgttctagat ataagtctcc tacatatgat 7800 

ttgtaaaaat tttccttcca ttattgggtt gtctttcact ttcttttggt gtcctttagt 7860 

gcacaacagt ttttaatatt gaagtccaat tttctatttt tctcttttgc cacttgtatc 7920 

ttggtgtcat gtttaaggaa ctattgccta atctcaggtc acaaagattt acacctgtgt 7980 

ttccttcttt ccttccttcc ttccttcctt ccttctttcc ctccctccct ctctccctcc- 8040 

ctccctctct ccctccctcc ctccttccct tcctccctcc ctccctcctt ccttccttcc 8100 

ttccttcctt ccttccttcc ttccttcctt ccttcctttg tccttctgac ggaatcttgc 8160 

tctgtcaccc aggctggagt gtagtggcac gatcttggct cactgcaacc tctgcctcct 8220 

gggttcaagc aattctcctg cctcagcctc ctgagtagct gggactacag gcacacacca 8280 

ccatgcccag ctaatttttg tatttttagt agagacgggg tttcaccaca ttggccagga 8340 

tggtttcgat ctcctgacct cgtgatccac ccgccttggc ctcccaaagt gctgggattg 8400 

caggtgtgag ccaccatgcc cggcctgtgt tttcttagag ttttgtagtt ttagctctta 8460 

tagttagatc cttgatccat tttgagttga ttttgtatat agtgtgagat atccacctgg 8520 

tgttgtaaat tgcccagaag tgggtatgct tctaaatctg gctgttaggg attactagag 8580 
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gtgaccaaag tgaatttttt ctttgtttct tttttttttt ggagacagag tctccgtcac 8640 

ccaggctgga gtgcaatggc ttcatcttgg ctcagtgcaa cctctgcctt ctggtttcaa 8700 

gcagttctcc tgcctcagac tcctgagtag ctggtattac aggcgtgtac caccatgctt 8760 

ggctaatttt tgtattttta gtaaagatgc agtttcacct gttggccagg cttttctgga 8820 

actcccggcc tcaagtgatc catctgcctc tacctcccaa agtgctggga ttacaggtgg 8880 

gagccaccgt gcccagtcct tttctcagaa tttatttgtt tttttttgtt ttgtttcatt 8940 

tttgagatag ggtctcactc tgtcagctag gcaggagttc agtggtgtga tcattgctgc 9000 

agccttgaac ttctggactc acgtgatctt cccacctcag cctcctgagt agctaggatt 9060 

acaggcatgt gcttccacac ctggctaatt ttttaatttt ctaggactta tttgtccatt 9120 

cttgcaaagc agggtacaac atgcctatct ctacctacct ctcttcccfct caagggactc 9180 

cagccaaaat ccttgaggct ctcgggctga ctgtgggtgc tgttgcctga tctgcctcag 9240 

tcatgctgca tgatcaaaag tgtccgtttt ctgcttcttg gaactttatt cactttgggt 9300 

gtcagtcttc ctctgcagtg tcccaagaac acagaattag accaggaatc tgtgttgcca 9360 

tagtgtgtgg aaagaggcag acttccaact ccgctatgtg ctgttgggtg attgaagctt 9420 

aattttcttt ctatctttct ttcttttctt ttcttttttt ttttttggag atggaatctc 9480 

gctctgttgc ccaggctgga gtgcagtggt gcgatctcac ctcactgcaa cctccgcctc 9540 

ccaggttcaa gcgattctcc tgcctcagcc tcctgagtag ctgggattac aggtgcatgc 9600 

caccatgccc ggctaatttg tgtaatttta gtagaaacag tgtttcacca tattggtcag 9660 

gctggtctcg acctcctcac ctcaggtgat ccacccgcct tggcctccca aagtgtcggg 9720 

attacaggcg tgagccaccg tgcctggcac ttaattttct taatacctca attaccccat 9780 

atggtaaaat gggactagta atccatacct tatagcgctg ttgtgaaaat gaaatgaggg 9840 

taagcagata aaatttcaga ctacggatgg gattgttact acattctgaa cctggctttg 9900 

ctgttatttg ctatgtgacc ttatcttctc tggatctcca ttctttccaa gtctataaaa 9960 

caaagtggac aattgtcaac ctttcttcca aagagcaatg atttaaggat caaatgatgt 10020 

catttaacaa aaatatgaag agctcaacaa atgaggaact cattattatt attacaatta 10080 

ttattatttt agaaataggg tcttgttctc ttgcctaggc tggagtccag tggtataaac 10140 

acagctcaat gcatcttcag cctcctggat acaagtgatc ctcatgtctc atccccctaa 10200 

gtagctggga ccacaggcat gtaccaccac gcacggctaa ttttttattt tttattttta 10260 

ttttttgaga cagtcttgct ttgtcgccca gactggagtg cagcagcgca atcaccgctc 10320 

actgcaacct ccgcctcctg ggttcaagtg attctgctgc ctcaacctcc caagtagctg 10380 

ggattacagg cctgtgccac catgcccggc taattttttt gtatttttgg taaagacggg 10440 

gtttcaccat gttgcccagg ctgatctaga acccctggcc tcaagtgatc cccctttctt 10500 

ggcctcctaa agtgctagga ttacaggcgt gagcctctgc acctggcctc ggctaatttt 10560 

ttattttttg tagagacagg ttctcactat gttgccaggg ctggtcttga actcctgggc 10620 

tcaagtgatc ttcccacctc agcctcccaa agtgctgaga ttacagatgt gagccactgt 10680 

gcctggcctg gaactcatta ttgaagcatt cactagtatc aactttgggg ttacctggcc 10740 

acatcctctg acctacctat aagggtatca cagctaacgg agcctctgtt tctcagaatt 10600 

taggcagaag cagttcaatt tatcacaaac tactctatat ccagcataag tgcccaaata 10860 

aaacaattgc taaagttctt taggcattta ctgtttgtta gttagatatt tagtcctcac 10920 

tacaaatctg tgatacaggt attattttta ttaaccccat tttatagaag agaaacctga 10980 

agctcagaga tgctaagtaa cttgtgcaag gtcacacagc tagtaaataa agggcagagt 11040 

aaagatttag tttcacattg gactccagaa cctttctact gggactcatg ggaatagtgt 11100 

ggatgtccct gaccttcagt ggcccagggc tctcctgggg gaatccagcc atagacaaga 11160 

caccagcgag agcccaatcc taagattttg tttgtttgtt tttgagacaa ggtctcactc 11220 

tgtcaccaga ctggagtgca gtggcatgat caatgctcac tgcaaccttg atctcccagg 11280 

ctcaagcaat cctcccacct cagcctcctg agtagcttgg actacaggtg cacaccacca 11340 

cacctgacta attttaaaat tttatttaat taattactta ctattatttt ttgagacagg 11400 

gtatcacttt gtcacccaag ctggactgca atggtgtggt ctcagctcat tgcgtcctcc 11460 

acctcccagg ttcaagtgat cctcccacct cagcctctgg agttgcaggg actgcaggtg 11520 

tgcgccacta tgctcagcta atgtttttat tttttgtata gatggggtct cactatgttg 11580 

ccagggctag tctcaaactc ttggactcaa gcgatcctcc tgtcttggcc tcccaaagtg 11640 

ccgggattac aggcataaac caeca caccc aacccctaag gtgtttttgc tgaatgtgac 11700 

catgtcagag gcaggaaagg gaagcatcat ggggttagga aaggaacact gagcagggag 11760 

acaaagaaaa tgggatcatt ttgtgagtgt tcgctgtgtg tgtatgtgtg acaattctca 11820 

gagccagcct ctcaggtggt tgagaccaca gtccccattt cccagatgag ataatggagc 11880 

ctcagagagt ttctgcagca cagctagtgg aattagaatt tgaacccggc tcttccagac 11940 

tccaggtgct tcacaaccat cccaaaccta gtcatttgca gtttaccttc atgattttac 12000 

catttccctt tgccatagct agtgttattt acttaataat tccttttgaa tcagtctgct 12060 

taaaaaaaaa tagcttcatt ctaaagtgta atattcttgg aatatcgggt ttgctgttac 12120 

ccacccccac acgttataca tatacatgta tgtttctaat acatatatat gtacgtatat 12180 

acgtgtatcg ttttttgtta ttttttttgt tgttgttagt tttttttaga tggagtctct 12240 

ctctgtagcc caggctggag tgcagtggtg tgatttcggc tcactggaac ctctgcctcc 12300 

tgggttcaag cgattctcct gcctcagcct ctggagtagc tgggattaca ggcacccacc 12360 

actacacccg gctaatgttt gtatttttag tagagacagg gtttcaccat gttggccagg 12420 
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tgggtcttga actcctgatc tcaagtgatc cacctgcttt ggcttcccaa agtgctggga 12480 

ttataggtgc gagctactgc ggctggccaa tgtatgtttt taatacacat tcaaataacg 12540 

aataactatg aaacctgaaa aactgctcca tgttacttcc tgaacccatc ttgagtgctc 12600 

acatgctgtg cataccacat attgggaaac actgctttcc ctggcttcca agcccagctt 12660 

aatcactgtc ccatcctatg cttcgcttta tttgtctata aatgttgggg ttgggggttg 12720 

atgccaaaga ccttttctgt tgtcattaac atggacacag ctctaagagg tcttggcatc 12780 

ttgggctggc tctcctttta gttcagaatt tggattttta tccaactact cagagtgatc 12840 

aagccttcct tatgaatgaa ctcgttggtc aaactcataa aaggctgatc gataaaacag 12900 

gaatgaatgt atgaattgac actaagtcat tagcatttca cgggaatgga ttctccgtta 12960 

gtggaagagc acatgtcctt tctggcactg atgtgtgctt gggaaactta ctgagctaac 13020 

tggcccatgt aacacagagg ccctttggtg cagtggaaaa ctgttgactt tggagattat 13080 

cttgagtttg aatctgagcc tgcctgtaag aagctggcta actgaattgc tttgcttctt 13140 

ggacccttac catttataaa atggggacca ttgtactcac cctttagggt tattgcatgg 13200 

attaaatggg attctctata gaaaatattg gcacaaagta ggtgtaaatt tgcacgctag 13260 

tgggattgtt tgtgagggaa attgtcattt gattatcaaa gacttaggag caggaacagt 13320 

gtctaattca gggactgcaa atggaaatgc cagctgaggc caggcatttg ctaataattg 13380 

ggtaaagcag ggcaggtgta gaatagcaat gtctgggaat taaaagagag gtgaggacgt 13440 

gtatgacctt gagaaggcaa gccctggcaa aaggggatgg cctccactca gctacagtca 13500 

tgcctagatc ttctaacttt ttatttttat ttttattttt tgagacggag tcttgctctg 13560 

tcacccaggc tggagtgcag tggcgcgatc tcggctcact gcaagctccg cctcccgggt 13620 

tcacgccatt ctcctgcctc agcctcccaa gtagctggga ctacgggcgc ccaccaccat 13680 

gcccggctaa tttttttttt gtatttttag tagagatggg gtttcaccgt gttagccagg 13740 

atggtctcga tctcctgact ttgtgattta ccctccttgg cctcccaaag tgctgggatt 13800 

acaggcttga gccaccgcac ctggccgatc ttctaacttt ttaaagagaa gcaagacatc 13860 

tggattttta tgtgataact cctgatttta aactggcacc caattataat ttacaacact 13920 

ataagggtca acattgccag cagagcaaaa catgggtggg ggcaactgct ggtcaccggt 13980 

gtgcagcctc tggtctaaaa tcatctttgt atttcttctt* gctttacgca ttgtcccagc 14040 

acagtgctgt tgtatagtaa atatccagta agtgggtgta gaatgaataa accaatgcag 14100 

ataaacctgt agagaggccg ggcacagttg ctcatgtctg* taatctcagc acnnnnnnnn 14160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntagtccca gcactttggg aggccaaggt 14280 

gggtagatca cctgaggtca ggagttcaag accagcctgg ccaatatagt gaaaccccgt 14340 

ctctacaaaa ataaaaaaat tatctgggca tgattgcagg tgcctctaat cccagctact 14400 

cgggaggctg aggccggaga attgcttgaa cctgggaggc ggaggttgta gtgagccgag 14460 

atcatgccat tgcactccag cctaggtgac ggagcaagat tctgtctcaa aaaaaaaaaa 14520 

aaaaaaaaag aaaaaagaaa agaaaaagaa acaatgaatg agtgtgaggc tcatggtagt 14580 

attggttcct gagagtagcc aaccttattg gtcatcccag ccacgaagtg aaatggtacc 14640 

cctggcttgg gccaatgaat gaggaagaat aatggcaaat gggggtctat gcctccaccc 14700 

tccaccacta gggaggtctc aagcttgaaa tccagtgacc aggtttttag gtcctggacc 14760 

tggccagtcc tcctacagtc aagtagataa gtggagggtt tggtccgttg ggctacggag 14820 

atagtgatca aggccgttac tctgcaatca gactcagaaa tggcctctca gttacttctc 14880 

catttgtggg tcttttggaa gagcagagaa gaggaaggaa tttaggtctt ctcaccctct 14940 

gggctgcctg tccctgctcc ctgagccatg gagggctggg gtggaatatg gggaataaat 15000 

ctgtactttt tttttttttt ttttttgaga cagagtctcg ctccgtcgcc caggctggag 15060 

tgccgtggcg tgatctctgc tcacagcagc atctgcctcc cgggttcaag ttattcttcc 15120 

acctcagtct cctgagtagc tgggattaca ggtgcccacc accacgcccg gctaattttt 15180 

gtatttttag tagagacagg gtttcactgt gttgggcagg ctggtctcaa atacctgacc 15240 

tcaggtgatc cacccgcaca tgcctcccaa agtgctggaa ttacaggcat gagccaccgt 15300 

gcccggtcct accaatctgc acattttaat tgacaagggt caccctccac tcatgtgcca 15360 

ggcatagttc tgagaagcat cccacaagga tgcctctgag ttcaccctga caagtccact 15420 

agctcttggc agagacatct ggcaaattca aggcttgaga catgctggcc tctctttaaa 15480 

gtgcagcaaa ttttgtctag agcttggtca gttaaaattt tgatgttttg ttttgcatta 15540 

atttcaattt ttaagaaatg ttgcattaaa atgttattta tcttgaatag taaatttctt 15600 

agtgtcccct taatttctta gtgtgtctga gttgagagcc tcccctgcct gattctagtc 15660 

cagaccctgg ggtgacagaa gactggtggg agatgggagg tgaggagggg agtgttggtt 15720 

ggagaggatg atctacagag tgctggagag actctgtatg gagcttttca tgctgcctgt 15780 

ttgccagccc tgaagctatg ccttgaggtt gggcaaggtg gcatatccta gatcagagat 15840 

cctcaactgg ggccattttt ctccccagag gacatttgga aacatgtgga gacatttttg 15900 

atcatctgcg ggggtgggga gaggggctac tgacatctgg tgagtagaga ccagagggac 15960 

cattaaactt tctacaacgc ccaggacagc ccctccacaa taaagagtta tttgacctca 16020 

catattaata gcacaaagtt gaggaacctt gatctagatc cacagcacag aagaaaggat 16080 

gtagattttt cacacattaa agatgagaaa gcttgtgcct gtaatccctg tgactcagga 16140 

ggctgtggca ggaggattac ttgagcccag gaattcaggg ttacagtgaa ctatcatcgc 16200 

agcactgcac tccagcctgg gtgacagagc aagattttgt ctcttaaaaa aaaaaaagat 16260 
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gaggacaggc acagtggctc atgcctgtaa tcccagcatt ttgggaggcc gaagtgggtg 16320 

gatcacgagg tcaggagttc aagaccagcc tggccagcat agtgaaaccc catctctact 16380 

aaaaatacaa aaaattagcc agctacttgg gaggctgagg caggagaagc gcttgaaccc 16440 

gggaggtgga gcttgcagtg agccaaaatc ttgccattgc actccagcct gggcgacaga 16500 

gcaagactcc gtctcaaaaa gaaaaaaaaa aaagatgaga aagaggaagg gagagaaaaa 16560 

agagagagag gaaagaaaga gagaaggttt tggagtcaaa aagacttaga aattccagtt 16620 

cttccacttc ccatggaacc ttggcaagtt gccttctctc tttctctgaa tctcacattt 16680 

tgcctctgtg aagtaggggt ggtacctggt ggagatgatg cggagatgag ggtgaggggt 16740 

gtgttgcaca ctatgcccct aggatgggtg agagcttggg agcactgaac ctccctttcc 16800 

cctcttgttt cttcccccca ttgtctccca ccagctccct gggatctcca cttcactctc 16860 

tgggattcca ccagcaggag gctactcctg gagttaaggc gtgttgttca gactggggca 16920 

ttttaggggg cataaataat aattatgcct ggacaatgga cataacatct agggccttct 16980 

gaagcaaacc agggtgtggg gtacccaaac aaggcagtag gccccaggag gcaggtccct 17040 

gcagtcccag cagagagcag ggcacagggt tgagaagact gagcaaactt cattatcagc 17100 

tcctttgtcc cccactctgt cctggagcaa tcattctggc ctcttcccac ttccccaaaa 17160 

acccagtata aaggctgctt ctggcccctg aagccagagg cactgagagt ggaggtctca 17220 

gactcttgga aggtgagttc ttttctggct gcccaggcag gaccagtgta ggccctggga 17280 

agaagcagca cctcataggg caaacacgta ggaggcctgt ccttaggaac atcatagcta 17340 

agcagacctg tccccgcagg ggcaggagtc tgggctaagg gtgatactgg agagcagcaa 17400 

cggagactgg aagacaaatg aaatttggta cctgagttat ccctcccacc attccttttc 17460 

tagactctcc agctcagggt ctgttcatgg caagaggaga aagcaatctt gtttgctctt 17520 

taatcaaaca attaaacaaa tattccctct atactatgtg ccaggggcta tactagacac 17580 

acaaagacag ccccaagaag gacggtggag tagtgtcctc gctaaaagac agtagatatg 17640 

caatgcctct tgctcctgcc ctttctcctg ctgggaacag tttctgctct tcatctgggt 17700 

aagtctctcc cttccctcct catgcgtctt tccctttttt cctttttcct acactcccct 17760 

ccccccgctt ttatttgcac tcatgaggcc aggaccacag ccttccctct ttagctgata 17820 

cagctcatct ccggtaagat atcacttgga ctcagaactg- taacctggaa ctttctcttt 17880 

tttgtttgat ttttttttgt tgttgttgtt tttgtttttt tttttgtttg ttttttgttt 17940 

tgttttgaga cggagtctcg ctctgttgcc caggctggag* tgcagtggcg cgatctcggc 18000 

tcaccacaaa ctccgcctcc cgggttcaag caattcttct gcctcagcct cctgagtagc 18060 

tgggactaca ggcacatgcc accacgcctg gctaatcttt gtatttttag tagagatggg 18120 

gtttcaccat atttgccagg ctggtctcaa actcctaacc ttgtgattcg cccgccccgg 18180 

cctcccaaag tgctgggatt acaggcgtga gccaccgcac ccggcaaact gtaacctgaa 18240 

ctttcagaag gaaaaaccac ccacctgtta agatgaaggg ctggtgactg ccccaggctt 18300 

ctcacacgtg ctttctccca ccttcaaaac acacactcgt ggtgtcggcc agaagtcagg 18360 

ttcttgtcca tttgtgggtg tgacccgaga gatctctcct tacctaacac caaggaaatc 18420 

ctccagtctt gtcttcaggt ggaattccta ggaaagctcg agcgacgttg ctggagctgt 18480 

ccacggtgct ggaactagga agctcttgac ctgatggcag gttacctctt cttcccagag 18540 

aatgatgccc cccatctgga gagcctagag acacaggcag acctaggcca ggatctggat 18600 

agttcaaagg agcaggagag agacttggct ctgacggagg aggtgattca ggcagaggga 18660 

gaggaggtca aggcttctgc ctgtcaagac aactttgagg atgaggaagc catggagtcg 18720 

gacccagctg ccttagacaa ggacttccag tgccccaggg aagaagacat tgttgaagtg 18780 

cagggaagtc caaggtgcaa gatctgccgc tacctattgg tgcggactcc taaaactttt 18840 

gcagaagctc aggtaagtag tagggaggct actgcggagg acctggggga aaagagagta 18900 

cattcagtct tctgttccct attcatttag gctagtggtt ctcaaagcct cgcatgcatc 18960 

agaatcacct ggagttgttg ttaaaacaca gctttctggg cctcacctgc acgacttctg 19020 

atttaggagg gctgaggtga agcctgagaa tttgcattta caacaaatcc ccaggtgatg 19080 

atgatattgt tggtctgggg agaaccaccg atttaaacaa aaggctttgg tgttagaaac 19140 

gcctgtgtta aattctggtt ctgcctttta ttagctgtgt tacctgggca agttgctttg 19200 

cctttcaaag ctttagcacc ttcatttgta aaacgaagat atatagcacc aacttcttag 19260 

agttgtggtg agcattaaat gagataatac atgaaaagtg tttggaatag tcactgggct 19320 

gtaataaact ctcaataagc ggtggttata attattatga gtattatcat ttcctgtagg 19380 

attgtcctga cagctaatta agaagcaaaa gataggatta agggaggcaa gtaggtttat 19440 

ttttaacctg aaaagggatg ccgggctctt gcctggagac tcagaaactt gaaataaatg 19500 

agagggaatt cnnnxmnnnn nnnnnnnnnn nnnnnnnnim nnnnrnmnnn nnnnnnnnnn 19560 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngaattctct 19620 

gttagcacat agccagaaca tctagaaggg gtggtaggag tggggattag aggttccagc 19680 

tggaggcaat ggcacttgca aaggctttgt tgaagtggcg taagtgtgga ggtggagcat 19740 

tcaggaaagg agagcttcag cttcagtgtg gctggagtgc tgggtgtgaa gagaggtgaa 19800 

gatgaggctt ggaggctggg cagattttgc tccaaaagag cttggtgaac tgtgataagg 19860 

agtttggatt ttctcctact aaggacaaca gcaaactatt gaagagttta aatcgttcag 19920 

tgacaatgac acgtttgcgt tttggtggct cactcgagct gccagccagg tagacagtgg 19980 

cagaagatgg aagataaagc actaaagggt gatgaggcag gaagccagtg aggagagaaa 20040 

ggggacgatg tgagtgacag taaatcattt gttgggttgc tattgtgtgc taagctctgt 20100 
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gctaaattct tcacgtgtat tatttcagct aatccatcta acaactctgt aaggcaggta 20160 

caatcgttcc cagctgaaga agctgaggct ctcaaaagct agtaacttgc ctaagttcat 20220 

gcagcatgca agttgtccag ccaggattct aacttagaca ccagaggcca cttttaacca 20280 

ctgctctagg actgggggaa atggtcccta gtgagatatg tgtcgagttt catatttcat 20340 

tcaacaatat tgttggcctg ctacatgtga agagctgtgg aaagcgccca aagtgagtta 20400 

gatccctatg agcaagtggg atgggggtgg agtggacagt aggagggctg gaacacacat 20460 

aaaagggtat aagaaataac aattaggccg gccaggggtg gtggctcacg cttttaatcc 20520 

cagcactttg ggaggccgag gagggtggat cacttgaggc caggagtttg agaccagccc 20580 

ggccaacatg gtaaaacccc atctctacta aaaatgcaaa aattagctgg gctggtggtg 20640 

cacgcctgta atcccagcta cttgggaggc tgaggcacga gaatc^cttg aacccaggag 20700 

gcagaggtta cagtgaactg agattgcacc actctactcc agcctgggag acagagtgtg 20760 

accctgtctc aaaaaaagaa aacaaaacaa gtaggtactt tctgccatag ggaggattca 20820 

taaactgcta gtcctcaggt gcatttttgc ttatcagttt taaaaatcag agaatgtctc 20880 

aaagaattag gatgtcagct tcttttgaaa atttgggcca gaagcggtgg ctcacgcctg 20940 

taatcccagc actttgggag gctgaggtgg gtagatcacc cgaggtcagg agttggagac 21000 

cagcctgacc aacatggcga aaccccgtat ctactaaaaa tacaaaaatt agctgggctg 21060 

gtggtgcatg cctttagttc cagctactca ggatgctgag gcatgagaat cacttgaacc 21120 

cgggaggcag gggttacagt gaaatgagat tgcaccactg cactctagcc tgggagacag 21180 

agcaagaccc tgcctcgaaa aaaagaaaaa gaaaatttgg aagatctgac aacagttgac 21240 

ctgcattcct gctcggcaac agcctgatgg tggatgggca gaggctcagt tgtctgccaa 21300 

acctcccatc actgatgtct tccctcgctg tcatcatctg cttgacatgt aggcatttgg 21360 

tgtgtgcctt ctgctctggg tgcccagatg aattggatgc tatatgagaa aacattctgt 21420 

aaatgtcttg tggtaggcaa cctcaaagat cactggggcc tccaatgatc cctccttcct 21480 

ggtattcatg cctgtgtata atcctctccc ttgagtgtgt actacacctg gatacttgct 21540 

tctaataaac agaacacagc aagggtaatg ggatgctact tctaaggtta aattacaaga 21600 

gtgtaaagtc tgtcttgttt gtttccctct cttgatcttc ctctcattct ctctctctcc 21660 

ctctctctca ctttcttact gtcttgtcct tccctttgtt* tactctgatg aagcaagcta 21720 

gcaagcatcc atgttgtgag ctgacctatg aagaggccca tgtggtggta aggaactgag 21780 

ggcagcctct acccagcaag gaactgagtc actcatcata tgggtgagct tggagacaaa 21840 

tccttcccca cttgagcttt cagatgacgg cagccctggc tgatgctttg caggcttgtg 21900 

agagaccctg agacagaaca ctcagctaag ctatacccta tctcctgaga tagagtataa 21960 

tacatgtagt tttaagctac tatgttttgg gataatttgt tactcagcaa tagataacca 22020 

atacatatac catgtacata actgtttcag ttgtctgaga ctatatttag tcattttaca 22080 

cctacatcaa gaatgtgtca ggcaccattc caggtacttg gaatacatca attaacagaa 22140 

taggtaaaga ggccaggcat agggctcaca tctataatcc cagcactttg ggaggcccag 22200 

gtgggaggac tgcttgagcc caggagttga gaccagcctg ggtaaaatag tgagacactg 22260 

tctcaactaa aaaaaaaaaa aattagttgg gcacagtggc acatgcctgt ggtgccagct 22320 

gctcaggagg ctgaggtggg aagatcgctt gagcccagga gtttgaagct ccagtgagcc 22380 

acggtcacaa aactgcactc tagcctgagc aacagaaaaa gaccctgtct caattaaaaa 22440 

aaaaaaaaaa aaaaggaaag aaagaaaaaa ataggtaaag atccttgatt cttgccctct 22500 

tggaacttct attctagagg gggatggttt ttcacagtag aagtctgtgt tgacagcgct 22560 

gtttaaagct ccttcagcat ctggggaaaa ggttnnnnnn nnnrmnnnnn rmnnnnnnnn 22620 

nnnnnnnnnn nnnnnrmnnn nnnnnnnniin nnnnnnnnim rmnnnnnnnn nnnnnnnnnn 22680 

nnnnnnnnnn nnnnattttt tagagatagg gtcttgctat gttgcccacc aggctggtct 22740 

tgaactcctg ggctcaagca atcctcctgc ctcagcctcc tgagtagctg ggaatacagg 22800 

tgtgcaccac catgcctggc ttatttcata tatatatatt tttatatata tgtatattta 22860 

tatatataaa tatatatata atttctgtat ataaataaat aaatatatat atatatatat 22920 

ttttagagat agggtcttgc tatgttgacc accaggtctt gaactcctgg gctcaagtga 22980 

tcctcctacc tctgcctttc aaagtgttgg gattacaggc gtgagccatg gcacctaact 23040 

gagttatttt taccacacga agcataggac atacatccaa aaatgttctg agctgagcaa 23100 

gagcctggag gcaagtgaat ctgaactttc ccgtctttga agaaaccagt ctctctccaa 23160 

agtcacatag ttagtgtcac tccccccaag aactgcatga gctgggacaa tcagagggca 23220 

gtggaaggtc tggggctcag gggcgccccc tgctgtctcc ccagggtctg tccccttacg 23280 

caagagcctc tgctccccca ctttcctgtg gagcctcctc accatgggca tgacccagct 23340 

gcggatcatc ttctacatgg ctgctgtgaa caagatgctg gagtaccttg tgactggtgg 23400 

ccaggagcat ggtgaggcac cgctgaggcc cctgggggtt gggggcacag gcgggtcacc 23460 

ctggctgagc tcccctcacc atacgtttcc ctacccacag agacaaatga acagcaacaa 23520 

aaggtggcag agacaggtag ggctatgaaa gcagggccct ggctcacgcc caccccactg 23580 

caacccgctt ctcagggggc gggactcctc taggcctggg cccacccagg taaccctttt 23640 

gtgggatgta agagtctggg ttcagaggaa ggctattttg gtgctctctg gcctccgctg 23700 

gaaggggtga tagtgtccac tgagtgccag ttcctgaccc cactgccctt cccatcctgc 23760 

ccagttgggt tctactcctc cgtcttcggg gccatgcagc tgttgtgcct tctcacctgc 23820 

cccctcattg gctacatcat ggactggcgg atcaaggact gcgtggacgc cccaactcag 23880 

ggcactgtcc tcggagatgc caggtgacct gcctgtacag ggatggtgac agcaagtggt 23940 
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caggcagtgc ttttcatttt ctctgtgcgt ttacatccag cagcttgttg ctttctccca 24000 

agaaccctag gagatcaggg gtacctcccc attttacaga tgaggaaact gaggctagga 24060 

agggacctgg cttgcttaat aataagaata gctaatgcag agtgctgact gtgcacttgg 24120 

caccttgcct tgtttagtcc tacaacacct ctttgaggta gatgcgttaa tatcttcatt 24180 

ttgcagttga ggaaaccgag gtacagggtt gcacagttag gtcattcacc caagatcaca 24240 

cagctttcag tggcagcctc cagaacctgt gttataaggg tacacgctaa agtcttgtta 24300 

gggctagaat aggtagagtt ggtatattag atatttattg ctgtataaca aatcacccca 24360 

aggcttggca ttttaaaaca acaaacactt ctcatctcat acagtttctg acagtcagaa 24420 

atcagggaga gactcagccg gctgattctg agtcacagtc tctcatgaag acatagtcag 24480 

gctgtcagcc agggctgcag tcatctgaag ggctgactgg ggttggagaa tctatgtcag 24540 

ttcaattacc cccatggcct ctccataggg ctgctcagga cacagcacct gctttccctt 24600 

gagcaagagg gctaagcgac agagaccccg tatcttctct cacataatct cagacgtagc 24660 

ataccatcac ttctgttacg ttctattata ggcacagagc aaccctgata tactgtggaa 24720 

ggagactgga caaagcaggg gaataccagg aggcaggatc cttgagggct gtcttgttgg 24780 

ctggagacca ccattgaggg tttttttttt tttttttatt gagacagtct tgctctgtcg 24840 

cccaggctgg agtgcagtgg cacgatctca gctcactgca acctctgcct cccaggttca 24900 

agcgattctc ctgcctcagc ctcccgagta gctgggattc accatggagt cttgaaccca 24960 

gattctgtga ctgcttttgc tctttttgtg ttcatccaaa cagtccctgt ttatcctaag 25020 

aggatgggag aaagagactg ggagagaagg aaatccagtg gcctccctcc ctgctagcag 25080 

agcctggccc tggcactgag ccttcctcct ctaccctctg ctcctaatgg tgagggtccc 25140 

ctagcagggc ccttctgtcc aggacacatg ggccgcctgt cctcacccca gcctactgac 25200 

ctctctcctg ggctggcctc agtgcccttg attgtgccgg agagaggaag cgctggacag 25260 

tcaggccaag ctgctgtccc caggagggca tctgcttatg tctagggcag ggacaccttc 25320 

ctgaggactt ctgatgagag acggtgtgag agcttcccac ttcccacctt ccttcccatc 25380 

cttggttctc aaaccttcaa gtgtgcatga gaatcactta gtgggggata tttgtccaaa 25440 

tgcagatttg cagatatccc cgctgagatt ctgagggccg agatgaggcc tgtgaatctg 25500 

catgttaaga aagcacccgc tttgatgcgt gtgtcattgg- gtaggggagc aacactttga 25560 

gaaacatgga gctagagaac gtgggtttct atgggtttcc catagaaaca tggatttctg 25620 

tgttttctgc tgccctgaca tcgaaggcac atctgaaggg ggaggggcca ggccaagaac 25680 

cagggagtcc tgggaacgta gaggcagcag ccagtgactt cccgtactcc tcagggacgg 25740 

ggttgctacc aaatccatca gaccacgcta ctgcaagatc caaaagctca ccaatgccat 25800 

cagtgccttc accctgacca acctgctgct tgtgggtttt ggcatcacct gtctcatcaa 25860 

caacttacac ctccaggtac ccaccttcat ccttcccctc tccctgcctc ccgaggctcc 25920 

tccaaaggga tggtccatcc agcacctgcc ttccaggaag cgcagttctg gtcttctgat 25980 

ctggatctat tttccgggtt ctccaggaag tgtttctagt agattgggtt ggcgaggggg 26040 

tgggaattga ggcccagttg gcctcttcgc cctacccctc cttcctccag cctccacaca 26100 

ctctcctaac ctcttcactc tctctttttg gttttagttt gtgacctttg tcctgcacac 26160 

cattgttcga ggtttcttcc actcagcctg tgggagtctc tatgctgcag tgtgagtctg 26220 

ttgggctgaa atgccttcct gagctttgca accgtgatca gagaacccca gggaagggtt 26280 

gggagggccc caggcatccc ctaatgcacc tctctctgag accctctgat ggcagggagc 26340 

tcacttcctt aaaggcagcc tatcctgctg taattgactc cccctgttgg agtcttccct 26400 

tagaggaagc tgaaatacct ggcttgatga cactttggtt ctatgtctgc tgtttgaaac 26460 

ggcccccaga atggcctccc ctccatgccc accctgaaga aatttcccaa gggcagccat 26520 

ttgccttata attttcctct tcatgttgga cagtccccac ttgcatctct ctcctggttt 26580 

cccctgctgg gcgctgctga gggactctcc cctgtgtatg tgatggagta acaggacatt 26640 

acaataatga tgacaaaatg acaaccatta tcaagtgctc cgttggtgca ggcagcaggc 26700 

aggatccttg accatcactc cctgagttca gcctcactgc agcggtctcg gcagagggca 26760 

gctctctttc cttcatctgc tcaagccaga accctggagt ttccttgatg tttctctccc 26820 

tcacactcca tgttcactcc atcctcagta cagccagcag cagcttctac acaccccaaa 26880 

tctgaccctt cttgtcacct ccactgctgc ctctccagtc ctagccacca acatctctag 26940 

cctggattat tgtggcagcc tttagtctcc cacatctgcc ctggccccgc tgtctcagtc 27000 

tatttttaac acaggggctg cagtcacctg tcaggacata agtctcttca catcactctg 27060 

tggtgtcctg tctcatctgt ctcagagtaa aagccaaagg ctttactatg gcctaaaaag 27120 

ccctgcaagc tctggcccca gcacttcact cccctctagc tccccctcct ccattgttca 27180 

ctctgccaca gccacagtgc ttcctagtgc tccggaagtc tcaagtgtgt tccctgcttg 27240 

gcatctttgc atgtactagt ccctgtttct agaacattct tctccagata tctgcaaggt 27300 

gcccaatctt accttctctc cttcttcagg tctttccctg actgtcctct tctcagtgag 27360 

gcctcccttg gctgtcccat gtacaattgc aacctcccta ctgcccgctt ctctgcttgg 27420 

tttttctcag cgtttatcac taacactctg cctatctctt gcttattgtc tgaccgccac 27480 

ctgctccatg ggaatgccac ctcctcgatg gcaggaatct gttgacttgc ttgatcgtgg 27540 

tatctccagc acctagagca gtgcctggca catagtaggt tctcagctaa atgtttgttg 27600 

acagaataca gtggacagtc ctgcgaggtc aatgccatcc ctgttattag tggaggaagt 27660 

ggggctcagg gagtttgagc cacttgccaa tatcacacat acaggaggtg tgagaaccca 27720 

gctcagtggc cctgaagttg gagcatttgc cctcaaggct ggggaccaaa gagcccatgc 27780 
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aaagagcccg aacgcttaag caccaccctg cctggccagc ggggnnnnnn nnnimnnxmn 27840 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnrmn nnnnnnnnnn nnnnnnnnnn 27900 

nnnnnrmnnn nnnnnnnnnn nnnncccact gcgcctggcc cattactttt aatggcaaaa 27960 

accacaatta cttttgcacc cacataaata gttaccatgg gctgagcatg gtggctcagg 28020 

cctgcaatcc cagcactttg ggaggctgag ccaggcggat cacttgaggc caggagttca 28080 

agaccagcct ggccaacatg gtgaaacccc gtctccacta aaaaatacaa aaattagctg 28140 

ggtgtggtgg cgqgtgcctg taatcccagc tattcaggag gcagaggttg cagttcactg 28200 

aaatcatgcc actgcactcc agcctgggcg acagaatgag actctgtctc aaaaataaat 28260 

aaataaataa ataaatattt accatgtttt gaccacctgt tatgtgccaa ctgtattact 28320 

taaaaacacc catgggaggc tgggcacagt ggctcacgcc tgtaatcgga cactttggaa 28380 

gggcaagcgg ggaggatccc ttaaggccag gagttcaaaa ccagcctagg taacacagta 28440 

agccctgtct ctacaaaaaa taaaaaaatt aactgggcat ggtggtgtgt gcctgtaacc 28500 

ccagctcctc gggaggcaga gggagaggtt cgcttgagcc cagcagtttt aggttgcagt 28560 

gagccaggac caagacacta cactccagcc tgagtgacag agcaagacac tgcctctaaa 28620 

caaacaaaca aacaaaagcg acctgtgggt aggtaggaac aggctcatag tacagatgag 28680 

aaagcagagc ttggagggct caagcgattt gccaagcaga ggtccaagcc gaggtctctc 28740 

tgaatccaaa gttaattccg tctatcatat caccacagcc ctctctgccc cagggagagt 28800 

ctctgcccac tccagccact cacgtgtaat tgacttcctc aggggcagga aaggcttcga 28860 

tgggccagtt gagggtgcag ttcagaaaga taaggcaggc caggccagac caggtgaaca 28920 

tgatgaccac gaaggccaca ccggcatcgt agatcagctg tgagaggagg gggcaggccc 28980 

gtgggggaga ctgcctggcc ccagacccca ccaaggtaga tcccaggcct cagaggcctt 29040 

aaagaagttc tcttctcccc ttgtccttgt gcccaatttg cagatgagga aaccaagacc 29100 

agaagtttag agtcagactc agaagaccca tcattccttt ttctttttca cttgaggccc 29160 

cctagagagc tatgaaatag tctccacaaa gcctgaagtt gctggccact ggctcaaaat 29220 

atctctgaaa tttccattat cttaaaaaaa tacatacatt tttgcctatg actccacaaa 29280 

cattcatgtt catgttcgca caaaaatgtc catttcatag tacgtacaaa ggaaacttag 29340 

tgctctaggt ttaccgggcc taatcgtgtt tatcctgccc cttcctggca cattccccag 29400 

gggaaaaggc aaacccagac tgctcatgct cagccttttc tcacctttcc caggtcctcc 29460 

cacgtgcaac aactgggggg gttggggaga gggaggtgca* agtgctctgc ccaagggctc 29520 

tcaaccccag ggcaggtaag ttctcaattg aatgagattc tgtgcaaatg tgtcagccct 29580 

tcttatggaa gaagctgatg caccatctgt cctcttgtcc tccccatacc atctgaccag 29640 

gataattaat gtctgctctc ccctcaggct cctgctcaaa cctttttctc tgcagtcttg 29700 

gaccttggtg ccttttcctc cctaggggca ggacagagct tcaaagggcc acacccccaa 29760 

atgtgtggag gtaagatctg gctcttcaaa cactacttca gttgaaaaga agggagaact 29820 

gcccaccctc catgcctgcc caccagaaca actgatggcc cccccaccca tgcgctctct 29880 

caaactcctt tggagacact gagcaaaagt accttcttta gtactctttg taaagtgcaa 29940 

aacggtatgc agtttggtac tgcccaccgt ggaggttgag gagcatggca tggctcaaag 30000 

ggtcctttga tatttgacag aggaaattga ggcccccatc ttgcactgag ctaaaacttt 30060 

ggtcccctgg cttcgaggta caccaggttg acctgtccag gatccagcct ggcataaact 30120 

cactttgtga ccttggacca aaccacccat cctctctgga aggtgtggaa aaatgtggcc 30180 

ccaaaggctg aataaagcca gagagtcagg gaccttgaac gcatgtgaag gggctggact 30240 

tgattctgta ggtgaagcta aaccactgaa ggtttttcag cagtgtgtga gccagttccc 30300 

catctgagat ctttctggaa gtcacgtgag tgacagagta cagagaaaaa gaatcagagg 30360 

cagggagacc agctgagaaa gcttgctgtg gcccaggaga gagggggaag gcctgcattg 30420 

ggatgatgac agagaaagga gagcggagaa gtcagacccg tgggtcagca ctagctgctg 30480 

ctcactcggc cccacccggt tcttgtgtca agacaaaaag aaaacccagg tggcctcata 30540 

ccttgattcc tgggaacgta atggcagaag aggcgtaaga gccaatcatg agggccatta 30600 

acgtggagcg caggttccca aacatgttgg gcagctgagg agggaaagca gcacccatga 30660 

ggtggggaca ccgtgaccct tgcccagcat tcccagccct gctccataca atagctccag 30720 

gagacgcagc agaaaagccc caaggtaaaa caaacagaaa aatcaatgtg ggaaactgta 30780 

ctctgccccc tgcctacaca gtcacagtgc cctttagctt caaaaaggct cccagacacc 30840 

cctcagagag acattttgtt aattttgttt aattccaggt ttcccaagtt tgttacgtaa 30900 

cacctctgaa aaacacatgg aataggtgct taagaaacac tgatcttggc tgggcgcagt 30960 

ggctcatgcc tgtaatccca gcactttggg aagccgaagc tggtgggaag cttgaggtca 31020 

ggagttcaag accagcctgg acaacatggt gaaaccccat ctccaccaaa aatacaaaaa 31080 

ttagctaggc atggtggcat gcgcctgtaa t'cccacctac tccagaggct gaggcaggag 31140 

aatcgcttga acctgggagg tggaggttgc agtgagccga gatcgcacca ctgcacttta 31200 
gcctgggtga cagagcgaga ctatgtcccc accccccaaa aaaaaagaaa agaaaagaaa 31260 

gaaacagtga tcttgtccaa cccatttgag atgagacaat tgagacccag ggaggaaaag 31320 

tgtactcaag ttcacagagc acattaatgg ctttctcccc attgtcgttg tcccagccct 31380 

aacccaaggc tgtgaccatg gctgtgtccc ggtaataggc agtgcctctt aaccctctcg 31440 

gttgacgtcc cagcccagtt tctgcctaat caggacaaat cacatcctgg gaggtgaggg 31500 
tggaaataag ggagggaact gagccagggc agacagtctc cagaggaggt ggctctgacg 31560 
cagagcaggg tcagaaccca caccaggaga gaatttaatt gatcatgtgt tccactcacc 31620 
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tgcctcagcc aagccctcag ggcaggggaa 
ctcctctggc cccaccatcc tccccaagtc 
gatccgtaca aagcctaaac acactccaca 
ctcaacagct ctggtaaaaa aggcgtttag 
cagcactttg ggaggccgag gcgggtggat 
gagcaacacg gtgaaacccc gtctccacta 
cgtgcacctg tagtcccagc tactcgggag 
aggcagaggt tgcagtgagc cgagatcgcg 
agactccgtc tcaacaacaa aaaaaaaaaa 
ccttccccct tcccctgaag tggctggctc 
ttttcctgca ccctgcatcc gtgaggcacc 
gcctggagag gggcagggcc ccctcccctc 
taccgtgggt aaggcgaggc cggccggcta 
ggcaacagca gctgggcccg caagactcag 
agagcagcgg cccagggggc ggggccggcc 
gccagcgccg gaccctgcca ttggctggag 
agccgctgga caagccccac ccggccgcca 
agggatttag ggccctgggc caagttgcac 
gggagaaagg gatccgcttc cttcctttag 
ggcggctgct ggggagaggt gaaattcagc 
gagataaaca gtccgagcca gcccggccag 
taacactcct aagcctttaa cgcgtttaca 
cacacataca ccactcacca catgtaatag 
aatttcatat ggttcaacct agtacactca 
agggacgggg tcacacaccc actctcccac 
gctgctccct ccccctcaag atcatgttac 
cgactgacat aatttattag tttacttgtt 
aatgtaacct ccagcaggga ggatgactcg 
ctagaacaga gctccatgga cattcatggg 
acaccccgac acacagcctc atccacacac 
cctgcacagt ttctcacaca ctcacttgat 
cagcccactc atactgccct caccccactc 
tgctgccagg ccaggcctgt gacactcacc 
ccaaagccat tcagggacag cgccaggaat 
gcagcagatg agggcatttg gggagctgtg 
tccgccttca tcccacctat tcttttcttg 
tccacgtccc gggaggccag ggccatgagg 
agacacaggg aagggcgagg ggttggcctg 
acggtccctg tcctcccgtt ccccatagcc 
ggtcgggggc caaagcggtc catgaggatc 
aaggaaccaa tggtgaagcc caggttgagc 
ctgcgctgct catcctgggt ggtgttggtg 
gctcagcaca tgacaccagg aacagctggg 
cctgctttca aatcccatgc caagtgcctt 
atggggctgc tccagaaatg gcagccatta 
cacagcctga ggcttcaact agctcaaatg 
tgtaatatgc actcagtgtc caagcttagg 
catatctaga ggcaaaggca aaggcagtga 
attaaaatgg cccggcgtgg tagttcatgc 
caggcagatc gcttaagcct acaagtttga 
tctctaaata aaaaaaagaa atttagaaag 
taaatgatag tctgataaaa taatagctaa 
cactgttata agtcagttaa taaagtatcc 
aatgagaacc aggcactgcc ctccagtccc 
taacggtaaa ctgccaactt ggagttgtga 
acaaagggct gcttcgcttt actcaatgct 
nimnnnnnnn nnnnnnntmn nnnnnnnnim 
nnnnnnnnnn nntccctccc tccctccctc 
ttccttcctt ccctccatct tctccacacc 
gcacttggtg aaagtttcaa ttctcctgtg 
ggctgtcaaa ggagacttac ccaatctttc 
ataaaataag ggccttttgc atctgattta 
tctgctgaat cagatagcta agggggccct 
ttatgaagca ctccaaagtt tgagacacgc 



ggcaaagtca ggatgccctt cgcacacacc 31680 

actagatccc acagctgaga aggaccttag 31740 

gagggggaaa ctgagactct gaagggaggc 31800 

gccgggcgca gtggctcaca cctgtaatcc 31860 

tgcctgagct caggagttcg cgaccagcct 31920 

aaatacgaaa aaattagccg ggcgtggagg 31980 

gctgaggcag gagaattgct tgaacctagg 32040 

ccactgcact ccagcctggg cgacactgcg 32100 

atggtgttta aacacatata actaaattat 32160 

aggaaaaacc tctacccact caggcagagg 32220 

actgccaagg acgccaggga aggctgccag 32280 

caaggggcca caaacgctgt ctgcgcccag 32340 

accccgggct ggcggccttg cagcgtgcgt 32400 

cacgggacgt cctcgtccaa gtctgggcca 32460 

agagggagcg gggagaggct gaggggcggt 32520 

attacaggag gcggggacat agcagggagg 32580 

gggagggtct gaggtcaaga gccggagaga 32640 

agcagggaga aggggctgcg cagaggggcg 32700 

agctgtgaaa tgtccccggt tggaattaaa 32760 

caaaaccacc cagtcaggca gcccttctca 32820 

gaaccttccc ctccaacctc cctaagcctt 32880 

cactcacata aataaacaca ctttgagcaa 32940 

gtcaagccat gtgcacgacg aggtgtcgac 33000 

caaacacacc taccaactca tggctttcac 33060 

gacatggcaa gcgtgcacac gctatctcaa 33120 

ccagttttat tttcttccca gcacctatga 33180 

tattgggtta • tctgtgcccc tcacccccaa 33240 

gtcagtcctg attgtgctgt agtccaggac 33300 

ctctgtacac acaaacacac acattaacat 33360 

acacagcctc acacctgctc tttgcagcca 33420 

ctagtgatct gcgtccacag gcccctcccc 33480 

actctgccct caccccactc gggggaactc 33540 

gtgagtgaag tgaacgttag gcagatgcca 33600 

atcaacggag acagagctgg aaaggggaaa 33660 

ggaagccaag ggcgggagct ggggtaaaca 33720 

tggggccaca agaggacaga caactcacct 33780 

gtgcaggacg cagtgaagca ggcactgtgg 33840 

tgagcacccc ccctcccctc cccctgcagc 33900 

cagccacctc acctgccaac cagccgcacg 33960 

cccagtggca gggtggtggc gctgagcacg 34020 

atctcgtcct gctggtcaca gcctggccac 34080 

ctgctctcag ctgaggaggg ggaagggagg 34140 

cacaggagac agca^cccac agtcaggcgg 34200 

tgggggtacc ctagagtcac atctcctctg 34260 

gtacctgacc ctgggagagt cttgtgcaca 34320 

aaatactgga cataaaagta tttactaagt 34380 

gggttgtgga cccccaacaa gaagtgcccc 34440 

gtggtactct aatggctata acaagaattc 34500 

ctgtaatccc accactgtgg gaggctgaga 34560 

gaccagcctg ggcaacatgg taaaacccca 34620 

aacactaaaa cttagaggaa gctttcccga 34680 

tacttattga gcacttaact atgctccagg 34740 

cgttccctag gtgatgaagc tgaggcacag 34800 

ctctagaagt ccacttggag gacttgtcct 34860 

caagttaagg agaaaagcta gtgataggag 34920 

cannnnnnnn nnnnnnnnnn niuinnnnnnn 34980 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 35040 

cctcccttcc ttccttcctt ccttccttcc 35100 

tggtatcatc atacagaagc agagaggact 35160 

tggagaggtg agcactgagg aaggggtggg 35220 

cagcccacca atcccttgcc cagtgtttct 35280 

agtaggaagc tgattcctga gcccctcaga 35340 

ggaatctgca ttttagcaag cggaggtggg 35400 

ctcaaaggtg gagtggttct gtggggggca 35460 
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gaaaggaaaa 
tggaaagggt 
caagccagac 
cgaagcagac 
cgtggagacc 
ctcatccccg 
gt:ccatcact 
ttctacccca 
cctttgctat 
ctcgcccctg 
gaccccgcca 
ggattgcgcg 
cgcatatatg 
aggcttcgaa 
cgcaccctct 
ccagctcgct 
cagacgccct 
ggccagaggc 
ggctgcagat 
cgggcagagc 
accctcggag 
cgcacgtggc 
actattcgca 
agccagccga 
aaaaagtcag 
ctttgccaag 
tctctcttat 
aatgctgtct 
actcttggcc 
gctatccttt 
cccttgggtg 
caacaccaac 
atcctaaacc 
tgtcccagag 
ctcttgtcct 
ttatttattt 
aaggaagctc 
actttcggca 
tctaaagcat 
aaatcctggg 
gttgggaaca. 
ctagttaatg 
caccaaccgc 
taagacttgc 
■ gacccccaac 
agggctgtgt 
gcacatggga 
accgctagaa 
aacccctcta 
gaaggttttt 
taaatctacg 
cttttggttg 
gcggtcacct 
tgctggagcc 
aatagcagga 
agtcatctgc 
agccagaata 
ttaatgttca 
agggcctggt 
cactgtagga 
tctgaccttt 
agacacagtt 
gacacgcggg 
gcaaataatg 



tgcaaagggg 
gatgccaggt 
cggggaggag 
acttgcaggg 
ctggagagcc 
ggcaccagcg 
cccccggcca 
cagtccccag 
ttgcgatccc 

ggagggcggc 

ccccgtcccg 
gcagggtctg 
cccccctttc 
attcctcgtt 
cccctcccca 
tccgcccctg 
tctcctagga 
aagcttcgag 
aacgcttgca 
aagccccggg 
ccagccgcag 
gcttaccccc 
aattgaggat 
acccgctggt 
tgttggaagt 
tctccagacc 
taacttctca 
tacagagcca 
ccaggagcag 
ccacagactg 
tgtgccttaa 
ctcttaggct 
aatgttagtt 
gcacaagcag 
cactgatggt 
actttaaagc 
aaaatttgca 
ggccctaggg 
tctgtcctaa 
aagtaggggg 
. gggaggtgag 
cccagctgag 
caaagacagg 
aagcagcagc 
ccctaaccca 
ctctctcaca 
cctcaggggt 
ggagataaga 
ggatagggac 
tttcctcctt 
gccatggctc 
ctcaggcctt 
cagagccgct 
gggcacggta 
ggcccagatt 
ccttagcctc 
aatacagcta 
gcaattctgc 
gggtgcccac 
gtctgggctg 
ggagatgtta 
caaggaactg 
ctctgactcg 
gcaattctta 



gaaggggtca 
gtggggagta 
ggggccacag 
tgcaccccgc 
cccaaccttg 
gccttcccgg 
gagcccacca 
tcctagctca 
agaactcgtt 
cccattaacc 
cggcgggggt 
ccacagggca 
tgggaaaaga 
ccctatcctc 
gccatctgtt 
cccagccccc 
ggccggaatt 
accccccacc 
aggacgggag 
aagaggcagg 
ccacgcagcg 
acccccgggt 
cccggacaca 
gggtgctagc 
cgggagtctg 
cctgaggaca 
agagaaacaa 
acctttggag 
ctgagaaccg 
ccgaggttcc 
cctctctggg 
atagtttgga 
ttcccttcat 
gtgcagggct 
gtcttctctt 
ctcactttaa 
taaagtttta 
atgctatggt 
tcctctgtat 
caggagctgg 
ccagacagcc 
cctgggtgaa 
cgcacaccag 
accagagagg 
ggacacagct 
ttcacacata 
agcctgtttg 
ggcaccctgg 
tgtcttcagt 
tttgcagtct 
tatgtgcatg 
gattgcctgt 
aagcaccttc 
atagaagagg 
cacctttagg 
ccacagggag 
gtacttatta 
aaagtggctg 
accatatggc 
gtcaggctgc 
accaatggga 
ggatgcacag 
g1:tcacatcc 
ctgagtgcct 



cacttgggga 
acagatagag 
ccaaggtgag 
cctctcttcg 
tttctggggg 
gaggctcaac 
acgctcctcg 
tctgcataaa 
ccccaccccg 
ctcgcgaccc 
ctgggggtga 
gaggccaggg 
cggggagggg 
cggcccccgc 
ccactccgca 
tccccaagcc 
tctgcctcca 
aaccaccacc 
tcggggaggg 
ggttttccct 
ccgcctgccg 
ccgctcctgg 
gagtgcagag 
caattctgat 
ggctcagagc 
ggttttccta 
agacaaaggg- 
gtgggggaga 
gaaagaagct 
aaattgagct 
cttgtttcct 
taaaatgaga 
ttggggactt 
ggataaataa 
gatatagata 
tgttaaaggt 
agataaaata 
gggaagtttg 
ggagaaaagc 
actccctcca 
agaggcgaac 
gaaggatggg 
ccagtctctc 
gaacctgccc 
ggcacctcag 
cacagacaca 
ccgatccccc 
tctcctccaa 
caatggagcg 
ttacaaaaat 
ttacaggtag 
catccaggtc 
agtgggccca 
taggaaggca 
gcaaggagag 
gagaaggcgg 
tgtgtagtca 
agatgagact 
actcactagg 
tcccgaaatg 
tcccgttcag 
cctggtggac 
cactctgcat 
ccttckcagg 



aggtttcaga 
gaggcaaagt 
acaggtcagc 
tggcaatctg 
gtgggtcaga 
acgcagatac 
aggtccgacc 
gctccaatta 
agcccgtttc 
gggccgctcc 
ggggcgcgcc 
ctctccggga 
ggcttctcct 
acccctcctc 
gcgccgcgac 
ccggggagtg 
tctcccaccg 
accgttgcga 
tgtagggcga 
cccgggtcgc 
ggcacaccaa 
ctcgcgctca 
accccggcaa 
tttgtacttt 
agcagggatc 
tctgaaaacg 
agggaaaatg 
tggccaaggc 
tgggacctcc 
ccaccaccta 
acagcgacaa 
tagctgtgta 
gctctaacct 
ggtatgtctt 
attttaaagc 
aaatgtaaat 
ggagactcca 
agtcatacct 
cagcttcctg 
agcactaagg 
gggctggcat 
ggtgtgggga 
acttcccttt 
tcctggccct 
gcccctttcc 
tgcatgtgtg 
caagaggtac 
cccaaggagg 
ttgacttagg 
agaacttctc 
aaaagccata 
ccttggtctg 
tcccattggc 
acaggaatcc 
agaaacagag 
ccatttttct 
ttgttccacc 
tctcaggtat 
taggtatgag 
gggccttctg 
ggtggcgaga 
agaaggcttg 
tactcactgt 
gctgttgtgg 



caataccgag 
gagtggagac 
agccagaaac 
agaccgagga 
gaggaagcct 
ctggataggc 
ttgtccctcc 
acatgttttt 
ccgccgcttc 
tggcggtcct 
ctggggcaga 
aaaaggcagg 
gggagactcc 
ctccccgcca 
aaacacggct 
ggggagtgag 
gggtccggct 
gggccggtga 
gtttaaagga 
cgccccccgc 
ggacctggcg 
gcctccccag 
gcctactgaa 
acaaaaacaa 
tgcgatgtga 
gaggggacag 
gcttagctgg 
ctctgaggtc 
tttctgcaga 
acactgtgtg 
gaaagaatga 
gaacagacag 
ccagggctta 
tctgcaggat 
ttcacgttat 
atagtataac 
aaaaagtgtt 
tagcattctt 
gatgtacccc 
gcagggcatg 
gccaagcgtc 
agacaccccc 
ttatttcctc 
ggaaggggcc 
ttctgaaagg 
cacactcatg 
caggaggcag 
aagaaagctc 
gggcgttttt 
ttggtattta 
tggggcactc 
agaagtctat 
ggcgtactcc 
caggagtgag 
tcaagtaggt 
ccaggtcctg 
agtatctcac 
aacaagtggc 
gaaggcacag 
ggctcacccc 
ggaggctctc 
gaaggcccag 
gtgactttgg 
cgaagatgta 



35520 

35580 

35640 

35700 

35760 

35820 

35880 

35940 

36000 

36060 

36120 

36180 

36240 

36300 

36360 

36420 

36480 

36540 

36600 

36660 

36720 

36780 

36840 

36900 

36960 

37020 

37080 

37140 

37200 

37260 

37320 

37380 

37440 

37500 

37560 

37620 

37680 

37740 

37800 

37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 

38340 

38400 

38460 

38520 

38580 

38640 

38700 

38760 

38620 

38880 

38940 

39000 

39060 

39120 

39180 

39240 

39300 
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agttaaaaaa aagtatgcat catgcttagc acatagtgag tgcttggtaa atagaagcag 39360 

ttatttcatc acaattcttt gggaggaggg tttacgtgtg ggtggcccca cagggcagat 39420 

gaaagatcag cgtcagggag gcagatgagt tcaatgtaag gaaaagactt actaacagca 39480 

gcagggctgc ctcgtgcagg agtgggtgcc ctaccactga gggtatctaa gctaagaggg 39540 

aagggtcccc tttcaggggt gctggagaca ggatcccaca ctaggtagaa ctggattgga 39600 

ccaatggtgc ctgaacacag gcccaagagt caggactggc cacttcacaa agcacctgga 39660 

gtttactaaa aacagactcc taggaggtca ggcactgtgg ctcacgcctg taaccccagc 39720 

actctgggag gccaaggtga gaagatcatt tgaggccagg agtttaagac tagcctgtgc 39780 

aacatggcaa gaccctgttt atctgtacaa aatttttttt taaaaaatta gccaggtatg 39840 

gtagccatca cctgtggttg cagctactca gaaggctggg gccggaggat cgcttgagcc 39900 

caggaatcag aggctgcagt gagctgtgat tttaccaccg cactccagac tgggcaacag 39960 

aacaagacac cttctctaca aaaaaaaaaa aacaataggg ccgggcgcgg tggctaaggc 40020 

atgtaatccc agcactttgg gaggctgagg agggcagatc acgaggtcgg gagatcgagg 40080 

ccatcctggc tagcacggtg aaaccccgtc tctactaaaa atccaaaaaa aaaaaaaaaa 40140 

ttagctgggc gtggtggtgg gcgcctgtgg tcccagctac ttgagaggct gaggcaggag . 40200 

aatggcatga acccgggagg cggagcttgc agtgagccga gatcgcacca ctgcactcca 40260 

gcctgggcaa cagaatgaga ctccgtctca aaaaataaaa ataaaaataa ataaataaat 40320 

aaaataacaa taaattaaaa acaaaaacag actcctacgg tcaggctgag atatcctgat 40380 

tcaggggact ggggaatctg tatttttaac actccgtgag gggttctaaa aggcagacaa 40440 

cttggaaacc tgcagattag agacctctga ggtgcctctg gctgagatga gtgagggatg 40500 

gcaccacata caaggcccta cccctgcccc caggagagtg gctcctgctc cccccacacc 40560 

aaccctcgct ctcacccaga agggctctcc tttcaggggt cccaccatcc ccatgaaaag 40620 

tggctgctga agcaaggcga acacagcact ggtgagggac tgcaggcctg tcagcgtccc 40680 

aaaaggggtt ggatgggaac ctgtccccaa aacgggagat caaagggtgg tgggggcctt 40740 

tcagcccagg caagaacttt ttcttttcct tcccaacatg ggnnnnnnnn nnnrxnnnnnn 40800 

nnnnnnnnnn nnnnimimnn nimnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 40860 

nrmnnnnimn nnnnnnnnnn nncactccag cttgggtgac- agagtgaaac cctgtctcaa 40920 

aagaaaaaaa aatcttaaag aataaggata taaagaaaga aaatattttt gtgtagctgt 40980 

tcaatgtttg tatttcaagc caagtgttat tacaaaacag tcaaaagttt ttaaaaattt 41040 

aaaagtttat aaagtaaaaa agctaagtaa gctagggtta atttttttat cgaacaaaga 41100 

aaaatatctt tgtataaact tagtgtagtc taagtgtaca ttgtttttat tttatttatt 41160 

ttttattttt ttgaaatgga gtttcactct tgttgcccag gctggagtgc aatggcatga 41220 

tcttggctca cggcaagctc tgtctcctgg gttcaagcga ttctcctgcc tcagcctccc 41280 

aagtagctgg gattataggc acccgccacc atgcatggct agtttctttg catttttttt 41340 

ttgaaatgga attttgctct ttgacccagg ctggagtgca atggtgcaat ctgggctaaa 41400 

tgcaacctcc acctcccagg ttcaagagat tctcctgcct cagcctcctg agtagctggg 41460 

attacaggca tgcaccacca cactcggcta atttttgtat ttttagtaga gacagggttc 41520 

tcaactaaag agaaccatgt tggccaggct ggtctagaat tcctgacctc aggtgatcca 41580 

cccacctcgg cctcccaaag tgctgggatt gcaggcatga gccaccatgc ccagccagta 41640 

tacagtgttt ataaagcctc cagtagtgta cagcaatgtc ctagaccttc acattcactt 41700 

actactcact cactcactca cccagagcaa ctgccagtcc tgcaagctgc atgcatgata 41760 

agtgccctat ataggtgaac cattttttaa tattttatac tatattttta ctgcaccttt 41820 

tctatgatta gctacacaaa tgcttaccat tgtgttacaa ctgcctacag taatcagtac 41880 

agtactatgt atgggtttgt agcctaggct ataccatgtt gcctacgtgt gtagtcgtct 41940 

atactgtcta gtttgtacac tctatcatgt ttgcataaag ataaaatcac ctaatgacac 42000 

atttctctga gtgtattcct gttgttaagc aacacatgta taaacattta caagaaatag 42060 

ctcaaatttt tttttctttt gatacagggt cttgctttgt cacccaggct ggagtgcagt 42120 

ggcgcaatct cggcgcactg cgacatctac ctccccggtt caatcgattc tccggcctta 42180 

gcctcctgag tagttaggac tacaggcacg caccaccacg cctggctaat ttttttgtat 42240 

ttttattaag agatggggtt ttgccatgtt ggctaggctg gtctcgaact cctgacctca 42300 

ggtgatctgc ccgccttggc ctcccaacat gctgggatta caggcatgag ccaccatgcc 42360 

cagccattac gtttttttgg ttgtttaatt tttttttttt taagagacag attctcactc 42420 

tgtcatcaag gctggagtgc aatggcacaa ccatagctca ctgcagcctc caactcctgg 42480 

gctcaaggga ccctcctgcc tcagccttcc cagtaactga gactacaggt gtgagccacc 42540 

atgctcagct aattattttt tatcttttat tttttgtaga gggggggtct ttctatgttg 42600 

ctcaggtttg tctcaaactc ctgggctcaa tcaattctcc tgctttggcc tcccaaaggg 42660 

ctgggattac aggtgtgagc ctgaaaacct tctagtgtgg aagtggaaga taggcccagg 42720 

ccacttatgt tttcaagtta agcaaggttt aggtcactta tgaagcctga ctagttttgt 42780 

ttgcttaagg gatctgcagg cctgacctcg gttttcattt gttttaacag tgtctatgtg 42840 

tatgtgtgtg tttatgtacg tgcatgatgg ggggaaagct cagaaatcaa gtaagccaaa 42900 

cacaaacatg taattataag cagggataaa ttctatgatg aagaagtatg ggccacggga 42960 

gagtacttgt gccagtctgg tgatcaggaa caatgtcctt tgggaagtga catttgagcc 43020 

atgccctgaa gtacggtagg agttggttag gggtgaggca gtaagaccca gagctggggc 43080 

ttcctgcaca agctcagctg ggcactgagg acccagtgga ctctgctaca gggcagtgag 43140 
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gagcagaaag 
agaggctgat 
gcaagactcc 
gcagtctcag 
gcagtgcgcc 
caaaactgaa 
ggctgaggtg 
aaacccgata 
gaggctgagg 
gcaccactgc 
aataacaatt 
ggtcacttta 
atacgacagt 
ctgccctgcc 
ttctcagtga 
atggaatgca 
atgttcatgg 
tggggctcag 
ggaggtttat 
gcaagcctca 
gtctgtggtc 
catgaagcgt 
accaatactg 
ttgagaaatc 
tggctcacgc 
tgagtttgag 
agccaggcgt 
tcacatgaac 
ctcttggcaa 
gtagactttg 
acacctccag 
gtgtactttt 
tcagtccctc 
cactcaatac 
gagcctcatt 
tctccgagtt 
tgggtggctt 
aatatggctt 
gaggtgcaca 
gggaggcctg 
gctgcagccc 
tctgtctctg 
ctcttccaaa 
ctgcgccact 
gcctcttagg 
ctagttactc 
aggccaagcc 
gtgggggtgg 
gcaaactcta 
acaggaatca 
gaggggtata 
ggcacatctg 
tcttgctctg 
cctcctgggt 
acaccaccac 
ccaggcttgt 
tggtattata 
gggataatgt 
ataagttttc 
caagattcag 
actaggagag 
gctccaattc 
ttcaagttct 
ctgtttgatg 



gctgaggaag 
gggggaaaat 
atccctgtaa 
ctactcaaga 
acgattgtgc 
tgaataggct 
ggtggatcac 
caaaaattaa 
catgagaatg 
actccagcct 
aaaaaaaaat 
gtaaagggag 
catggaaatg 
ttggccactt 
gtagctctca 
taaaagatgg 
agccctggga 
aggaggaagg 
ggagaaaggg 
actctgtaga 
ttggggacat 
atctcttcgc 
aagatcttta 
aggacaagtg 
ctgtaatccc 
actagcctgg 
ggtggtgcat 
ccaggaggcg 
cagagccaga 
tgtttttctc 
tcttgcctca 
tgattgtcct 
cccatcccga 
atcttgcatc 
atagccgttc 
tcgcttgtca 
gctttatctg 
aatggtggac 
agttagaggg 
tccagctgct 
aggtgttctg 
acttgtcaac 
agccttgggc 
cacctgctct 
tatcacttcc 
cttggaggta 
cagcctaaag 
caatggagtt 
gagaaccgtg 
gcatcccaca 
gggaggagcc 
tgttttcttt 
tcgcccaggc 
tcaagtgatt 
acccagctga 
cttgaaatcc 
ggtgtgagcc 
gcattttgtg 
ccccagggag 
cctctcaaaa 
ggcaagtgag 
ctagtgatag 
gagattgtta 
acgtcccaca 



gctgggtgtg 

cggtagagct 

aaagctttta 

ggctggggta 

cactgtactc 

gtgtgcggtg 

ctgtgattgg 

ctgggcatgg 

tcttgaaccc 

gggagacagc 

taaaaggcca 

aacaatggct 

ctagggccca 

cctggccctc 

tttagtgcca 

ggctgtgata 

gggagctcag 

acttgttcaa 

tgaccatcca 

cttggtgggg 

tgtcgctccc 

cgtcccccat 

gtaaagttct 

agtcagggcc 

agcactttgg 

ccaacatggt 

gcctgtaatc 

gaggttgcag 

ctacctctca 

aaaagcactg 

gggtcagttt 

aaatccagag 

atcccaggga 

ctcgctggac 

gttcctgttg 

gcatttctcc 

aaattttcag 

cctgtcaggg 

ttagacaagt 

ccacggttga 

gctgccctag 

taatggcact 

ctctgactgc 

ggcgctaggc 

cctgatccca 

tctgcttctc 

gacgcttcct 

ggggggggct 

taaataggag 

gcccatgttc 

agcagggtct 

tattttattt 

tggagtgcag 

ctcctgcctc 

tttttgcaat 

tgacctcaag 

actgcgctcg 

gaagcttggg 

gacactgttc 

tggagacagc 

tggtgtttgg 

gaaaccatta 

acacacctta 

ctgtgggtac 



gtggctcaca 
caggagtttg 
aaaattagct 
aggattgctt 
caacctagga 
gctcactcct 
gagtttgaga 
tggctcacat 
ggggggcaga 
gagactccat 
gggagcactg 
cctcccagga 
ggcagaccat 
tgcatgcccc 
gggctctcgg 
gcccagagct 
tgcaagttca 
agacacacag 
aggcttggac 
gccaggccct 
cttcctgctg 
ccttgctgag 
cttttttttc 
aggacaaaaa 
gaggccaagg 
gaaacctcgt 
ccagctattc- 
cgagctgaaa 
aaacaaaaac 
tcaagccagt 
ccagcctccc 
tctgtggcct 
gccgcaggtg 
ccaatccatt 
acctttccag 
aatcccatca 
atttgacttc 
gtagagaaaa 
ccagccacaa 
gggtggagca 
ctgcctggct 
atgagattgc 
aacatggagt 
gtgtgcctaa 
aatacttacc 
accggggact 
acatgacttc 
cagggagggg 
tgattattct 
agctatgaag 
tgagttcata 
tgaatttaat 
tggcgcggtc 
agcctcccga 
tttagtagag 
tgatctgcta 
gccacatctg 
ccgtttgtgt 
cacttaggga 
agttccaggc 
gatgtgggga 
agctacttgg 
caacaccgcc 
ctttatgaac 



cttgtaatcc 

agaccagcct 
gggtgtggtg 
gagcctagga 
gacaaagcga 
gtaatcccag 
ccagcctggc 
ctgtaattcc 
gggtgcagtg 
ctcaaaaaaa 
gcagcctgtc 
cctctgggat 
ctcagggaaa 
agggtctcag 
gcttacatcc 
aggggtttga 
tttctctttt 
ggagtgtttc 
aaagatcatg 
cccaaacaca 
atgctctgct 
agaggatggg 
attttctgaa 
acagtgtggg 
tggcggatca 
ctctacaaaa 
gggaggctga 
ttgggccact 
aaaaacaaac 
gcccgcagca 
tggacacttc 
gacctggttt 
tgtgcagaag 
ggcttggtga 
atcaatctgc 
tgtactttgg 
aggtctctcc 
tattgaggag 
ccagcccaag 
tacaggaagg 
ttggtagaag 
acataattaa 
ctgggtatca 
tcacttaatt 
aggtgtggga 
ccgaaaccaa 
aggcttgcgg 
atgtggaagt 
gtcccttccc 
aatggaaact 
ttagtgccct 
tttttttttt 
tcagttcact 
gtagctggga 
acagggtttc 
gcctcggcct 
tgttttaaat 
ctaggactct 
gtcaggaccc 
ctgggctggg 
agtattatga 
catcttaaaa 
gccgttatta 
aggaatttgc 



cagggctttg 

gagcaacata 

gtatgcatct 

ggtggacgct 

gatcctgtct 

cacttttgga 

caatatggtg 

agctactcgg 

agctgagatc 

aaataataat 

caaggtttca 

ctcagcattg 

acaagtggct 

caccaagctg 

tacgatgacg 

atctcatgag 

ttggttgaga 

agtgtgggac 

acttcgacca 

cctgacaggt 

gtccctctcc 

ttctcttctg 

agtccctctc 

acgagtgtgg 

cttgaggtca 

tacaaaaatt 

ggcaggagaa 

gcactctggc 

gacaaacagt 

gtgggcctag 

ccccaggtat 

gtcacagctc 

aggcacacca 

tgtacagact 

cagcttggct 

acctctttgt 

tttgtccctt 

ccctgacttt 

ctgcagtgta 

cttccttctt 

aaagaaaggc 

cctgggtctg 

ctccccatcc 

tctctgtgct 

tgacacctga 

acgaaaagca 

gggctggagc 

gctttgcttt 

tttctttcca 

gaggctccgg 

ttcctccata 

tttggcagag 

gcaatctccg 

ttacaggtgt 

acagtgttgg 

cccaaagtgc 

gagaggaaag 

tatgatcttc 

ccagtcctta 

ttctgttcac 

aaacagagat 

ccaagagcgg 

ggaagaagct 

tttttcaaat 



43200 

43260 

43320 

43380 

43440 

43500 

43560 

43620 

43680 

43740 

43800 

43860 

43920 

43980 

44040 

44100 

44160 

44220 

44280 

44340 

44400 

44460 

44520 

44580 

44640 

44700 

44760 

44820 

44880 

44940 

45000 

45060 

45120 

45180 

45240 

45300 

45360 

45420 

45480 

45540 

45600 

45660 

45720 

45780 

45840 

45900 

45960 

46020 

46080 

46140 

46200 

46260 

46320 

46380 

46440 

46500 

46560 

46620 

46680 

46740 

46800 

46860 

46920 

46980 
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cccagagaag 
gaattcaaga 
gttcagagct 
ggtagagctc 
tggctcatgc 
gagtttgaga 
agccaggcat 
ttgcttgaac 
ctgggcaaca 
acatactgtt 
cagctatata 
cctcctcagc 
tctaactttt 
aggcatgatg 
ttgagcccag 
aaaataaatt 
gagaggagaa 
gcactgcagc 
cgcggtggct 
gtcaggagat 
aaaaaagaaa 
ttgggaggct 
gatcgcacca 
ggaaaaagaa 
tatcacatct 
atgcagtatt 
catttgggct 
cactacttct 
cctgatcata 
cttttttgtg 
ggtggaaggg 
cacctggttg 
tatgtggtta 
gacaacaaaa 
nnnnnnnnim 
nnnnnnnnnn 
ggacatttgg 
actagtatct 
ggcctctgta 
agtaggcgct 
ctggtgtgat 
agaggtggag 
gatccaggtg 
tggctggatg 
aggggtgaag 
ctgagaaagrt 
gtgtgtgttg 
ggagaagcat 
ggagatgaag 
atgacataac 
atccccaata 
aatggccaca 
atcactactg 
ggggtagtca 
acaagactgc 
ccacttgctg 
taacctccaa 
atggttggcc 
gtctgttgag 
actttaagtg 
acttctgagt 
ttatttatca 
ataaatgaat 
gaggctcaga 



taagattaaa 
atgactgacc 
ctcccaatag 
aggaatctaa 
ctgtaatccc 
ccagcctggc 
ggtggcagac 
ctgggaggca 
gagggagact 
acaggcacag 
actaccacct 
agaggtaaca 
ctatcaatta 
actcatgcct 
gagttggaga 
agctggtcat 
tctcttgagc 
ctaggtgaca 
caagcctgta 
ggagaccatc 
gaaaaaaaaa 
gaggcaagag 
ttgcactcca 
aaaatatata 
gcgaaattaa 
ccgtcgtgtg 
gttaccaggt 
ggtgagcgta 
aggtaggtgt 
tatttctggt 
aagcaggagt 
gcatggagct 
gctccatgca 
gctgatgtgg 
nnnnnnnnnn 
nnnnnniinnc 
taatgtctgc 
ttttggtaga 
acaaaaaagt 
aaaacataag 
tagctctggg 
gagggtggta 
aaaaacggtc 
agatggtgaa 
gatgaccaga 
gaccacaaaa 
agttcagtct 
cttgggtgac 
acttgggaat 
ccgtggtggg 
tttaaccatc 
aaggctactg 
cagaggtcaa 
gtgggcacct 
tatggtaaga 
gaggagcttg 
gaaacacaaa 
cctagaaatg 
cccaggctaa 
atcctgacaa 
atctcctcaa 
gctaccttaa 
gaatcaatcg 
aaatacctgt 



gttggctgtt 
atacagaatg 
tcacccctga 
aatccatttt 
agcactttgg 
cacatggtga 
acctgtagtc 
gaggttgcag 
ctgtctcaaa 
accttaagtg 
atatcaagac 
gacccacacc 
gttttgccca 
gtaatctcag 
ctactctgga 
ggtggtgcgt 
ctgggaggtt 
gagtgagatt 
atcccagcac 
ctggctaaca 
ttagccaggc 
aatggtgtga 
gcctgggcga 
tacattgtgt 
tctacactgt 
actacgggac 
tctggctgtt 
tgcactcatt 
gttggctttg 
tagagcggaa 
cggtttctgg 
ggcttctcct 
atgaacccgg 
atttaaaggc 
nnnnnnnnnn 
cagcagtggt 
agacattttt 
ggctagagag 
atccagtcaa 
gagactgtgc 
ttttagaaag 
agactggagg 
agcaggtgac 
gaaagcacta 
gtcctgcctt 
agtgaagcag 
gagatgtgtt 
catatgtgtg 
gatctgcgta 
tgtgcagaat 
tggaagaata 
agaagggaag 
gtaggataag 
gggcaaaagc 
ggaggaagag 
gctttggtgc 
atcatccata 
ccatcccact 
gcgctacctc 
cactgaaaat 
tatattgcct 
acctccctgc 
atgatccaga 
agaatcgaaa 



ctccatcctt 
gggagcaaac 
actgcacccg 
aaaattaaag 
gaggccgagg 
aaccccgtct 
ccagctattc 
taagccaaga 
aaaaaaaaaa 
tacagcccaa 
acattccagg 
tctcctgctc 
ttcttgagct 
cactttggga 
caacatagtg 
gcctgtagtc 
gaggctgaag 
ctgcctcaaa 
tttgggaggc 
cggtgaaacc 
atggtggcgg 
acccgggagg 
cagagtaaga 
actttttggc 
gtgtatgaaa 
aatttgctta 
atgaataaag- 
tcgcttatgt 
taatgtgctg 
catgagggtg 
ctcacacatg 
ttggcgttgc 
gcttctgcaa 
ttcagttcan 
nnnnnnnnnn 
tctcaactga 
tgattgtcac 
gctgctaaac 
aaatgtccac 
ctgagagcaa 
ctcattttgg 
cagggaaagt 
taggaaagtg 
taactaacta 
gcaggtctag 
gtttgtgcgt 
ggactcacaa 
agtctgcagc 
tatatttggt 
taggagagac 
agaggagcct 
cagttcttaa 
aactgaagaa 
agttttggtg 
ggtgttgagg 
aaagcagaga 
tcctggctca 
tctcctctgc 
ctcaagcaag 
gtgtgtctct 
tgttttacta 
aactagagat 
gcctggtaga 
taaatgcatg 



gaaaaatttg 
ttgggaagaa 
gaccatcagt 
tatatcgggg 
tgggaggatc 
ctactaacaa 
ggaaggctga 
ttgtgccact 
aaaaaattaa 
tgaaatttta 
aactcagact 
cggtggtaat 
tcacacagat 
ggccgagacg 
agacccccga 
ttagctattt 
tgagccgtga 
aaagaaaaaa 
caaggcgggc 
ctgtctctac 
gctcttgtag 
cagagcttgc 
ctctgtctca 
atctggttta 
ggttggttct 
tccgtattcc 
ttgctatgga 
aaatatcttg 
acttggttat 
tctcttcagg 
ttgtgactga 
ctactgttgg 
aatacattaa 
nnnnnnnnnn 
nnnnnnnnnn 
gcatagtttt 
agcccagccg 
atctaacaat 
agtgttgaga 
gaaggagtaa 
ctgcttgtag 
aatttgggag 
gcagaggcaa 
atgtgtggat 
ttggaaggtg 
gtgtgtgtgt 
tgtccatggg 
tcagaaacag 
agcttgagcc 
gtgcaccaag 
gccaacagaa 
gaagggggaa 
tgtctgttgg 
gagcaatagg 
aagtggccag 
agccagctca 
aattccagca 
ttatcctatc 
ccttctctgc 
tccattcatg 
atatgctcgt 
tctctttaag 
ggcttgtgtc 
tgtgctctga 



gttttagggt 
agaaggcaca 
tatctctgtg 
ctgggcgcgg 
acgaggtcag 
tacaaaaatt 
gtcagaagaa 
gcactccagc 
agtatgtcat 
cacatctata 
ccatcatacc 
taaccactat 
atacattgtc 
ggagtatcac 
ctctacaaaa 
gagacgctga 
ttgcaccact 
tatggccggg 
ggatcacgag 
taaaaataca 
tcccagttac 
agtgagccga 
aaaaaaaaaa 
ttttgctcaa 
ttttgttgtg 
tatcggtggg 
tattcttgta 
ggtggaatta 
gctgaattcc 
gaatctggag 
actgctggta 
ggcaggtgtg 
caacgacaga 
nnnnnnnnnn 
nnnnnnnnnn 
gcctcagagg 
agaaggtact 
gcacaggaca 
ggtttaggta 
ttggaaagtg 
acagtgcatc 
ccactgaaat 
tggggatggg 
gatgggcagg 
atggtttctc 
gtgtgtgtgt 
acatccaagt 
gcctggggct 
acaagagtag 
aagccaggtg 
attgggaggg 
gtgaagaggt 
gtttggcaat 
gataacagaa 
cgagtctaca 
ctcattgact 
ctaccaggag 
ctatctgtca 
ctgccgtcac 
ttagttctac 
tctgtttgcc 
tatttgttga 
catggtggat 
tctaaactca 



47040 

47100 

47160 

47220 

47280 

47340 

47400 

47460 

47520 

47580 

47640 

47700 

47760 

47820 

47880 

47940 

48000 

48060 

48120 

48180 

48240 

48300 

48360 

48420 

48480 

48540 

48600 

48660 

48720 

48780 

48840 

48900 

48960 

49020 

49080 

49140 

49200 

49260 

49320 

49380 

49440 

49500 

49560 

49620 

49680 

49740 

49800 

49860 

49920 

49980 

50040 

50100 

50160 

50220 

50280 

50340 

50400 

50460 

50520 

50580 

50640 

50700 

50760 

50820 
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gctaaacttt ctccaggggg taaagttcaa gttgattagt caattgatta attaattcat 50880 

tatgtaatgg aaaaactcct tctatgacct gggcagagtt ataggcagtg aacaagacag 50940 

acaaggtcct tgttgtcatg aagtttgctt tctgaaggag agagataata aacaagaaac 51000 

cagtaagaaa gcaagattat atcattttgg taaatgttct tgtggaaata aatgtgatga 51060 

tgtgtaacaa aagtaccaaa taggagagtg gggtgggtgg gcttctttta gaaagagttc 51120 

tcggagaagg cttatctgag gaggtggcct tttaaccagt acaaatgctt tagcttggcc 51180 

agtggagctg ggaccaggat gacaagggtc acttgtcatg ccagtgagtt tgagcttgta 51240 

gacaagagcc tgatcatgaa agactttgca gatggtggta atgggtttgg gttaattgct 51300 

actatgtggg aagactttga atgggaagca tggggacaat ggcctgtgat acatgttatc 51360 

aaatatggtc gcaggggcta gtgaggtggc agcagagata gggagaagta gacggactgg 51420 

ggaaggtaga agatggggca ggggaggcaa ttactgcaaa gacatattcc ttctaagctc 51480 

actgagtgtt catggtctct gggagcagag gttcctggag gggaaagagg ataatgtcac 51540 

ttcctgagga agcgggaaga acccatctga gacgtgggga ctgtgctggt tcgtttctaa 51600 

ggggccttcc agatctcaca tgccaatcgt cttggtctat gtcaattgtt ggggcatcca 51660 

aatggggaac tgttgtccag gccgatttca cagaacaacc gcccagtcca tatctcccga 51720 

gccattcacc cttgcagtgg cgttagctct ttcaccagct tttatctgcc ccgtggggat 51780 

gttggccaag cccagttaac aagcagttga tcagccccag agatcaggtc cctggagtct 51840 

gtcacttttc tgagggtggg gagagaatcc tggagcagaa catgtaacta gaagggccac 51900 

ctggcttcct atggtctgag ggagagaatg gtgggatctc tggcctgaat caaacctccc 51960 

tttctcagtg tccatcttac ctctctgctg taccttcgtt attttccagc agctcctcag 52020 

cccgttcctg tgggaccctt ctctgccaat ccctacaccc actgtaaatt tcaccgtggg 52080 

agggagatgg gccttgaggg ctgtattagt cttctattct gcataacaaa ttgcctcaaa 52140 

tttagcagct tcaaacaact catgtttatt agctcatcgt gagttcatca gcagtgtggg 52200 

cccagcatgg ctaggttttc tgctcagggt ctcacaaggc taaaatcaag atgttgtctg 52260 

ggctgtgtgc tcatctggag tttagggttc tcttccaggc tcacgtggtt gtggcagaat 52320 

tctgttccct ggagttgcag ggctgaggtc ctgttttctt gctgactgtc agatgagggc 52380 

tgctctcagg tcctcgaggc tgcccacatt gcttgccacg- tgcgtggtct tttccatcct 52440 

tgaagccagt gatggagaat ttcccttgga ttgaatcacc cacatggttg gactctctga 52500 

cttcaggaag agagccctgt ctcttttatg ggatcacctg attagatcat acccatagag 52560 

ggcagttcct tttccttaaa gtcaactgtg gcatgtaaca tcacacaacc acaggagtaa 52620 

aatccatcat atttacagtc ccagggatta tgcacagtgc accaggggac aactgaattc 52680 

tgcctgtcaa aagggccaag caggacttta ttggtgaaga acagtggaat gtcattcttg 52740 

gttcttccag aaaaaaatca ctcagtaaag ttagaggttc tcttgccttt tgggaagtca 52800 

tcaaagaatc tcatggaggg tttggacctt caccctagaa acatcacacc atgttttcta 52860 

taattgcagg gttcatggtc ccttgaagcc tattcatagt ttccaggttg aaaagctctg 52920 

ctgcagggtg tggggaggga tgcaggtgga ggtgagggct gaatagtgtg agctgcatat 52980 

ctggagctgt ggtggttttt ttagtcttta agctgtcatg tgttgggggt tgggcatggg 53040 

aggggcatcc caagagctcc ttggtattga caccatctcc aaggtgatct ctgctctgcc 53100 

tggtgcacac atgtttttct cctgttgcaa cagcccactc ttgtagaaga gcagacccct 53160 

cagtaccagg tctgaccctg gacagcttgt accaggagct acagcacact cccccacaag 53220 

cctaaagttg ggatgagccc cccgagaatt agatcagaaa agattaaatg cagaggtgat 53280 

ctgtcaggtc ccctttggaa gtgctggtat ggagaggatt gactgagtct gtttaggaac 53340 

ctccaagctc tgtagtaact ttagggctag aaaggaggat gcctaagatt caggatcctg 53400 

cagtgatgag tcaacatttc ttggggaagg aggcagggct gaggattaaa cggagatgat 53460 

gggtatcgtt ctcttgctca aaggcactgg accccaaggc ctccagctct tcgctcccat 53520 

ttgaaattca agtcctgagc acaccacagt tgtgatgcag ggaaagaatg tgcttatcag 53580 

agagcctggg caagtgggcc ccttgtgagt accgttcaac ctcatttatg tcattggcac 53640 

caaaagtaga catcagtctc ttgaaagttt gattaatgct ggtcacactc aaagaccctg 53700 

ggtagcattc atttactaag caattactaa ataccagttt ctgtgctaaa tgctgcatca 53760 

gtcagggctc ttaatggcag gcagcagaaa ctctccttgg ctgatctaag tagaaaaatc 53820 

caggactgaa aggaaacgga gtagctcatg aaattgcagg aagggccgga aaaccagaca 53880 

tggagccaaa gtcaggctgc agaacaggtc tagggaggat cccactgctg ctgagaccta 53940 

gaccttgtgt ctggcaccca ggatgttgta gggctcagac cctggatcaa tgtatcctgc 54000 

agtgcctctg tgggtactgc aactccagga actcaatctt gtcaacgcca ccgccagaga 54060 

gaggccttct tggcctccat ctttttggtc actagctcca gattcaaaat cttgaataga 54120 

tgcttcttct ctttgataga gcccagtcat atgcgttagc tgcaaaggaa gctgaaaatc 54180 

tattaggaac ttttgtcttc aaaaatgaga ggcctgtcct ccaccaagat ccataggaaa 54240 

tggaatccaa gaaaccacag gaaggggtga ggtgactggg cagctcacag catgcatgct 54300 

acatgtgaat tatctcattc atttctcaca ctacccagtg aggtaggtat tgtcatccct 54360 

acttcataaa tgatgatatg aggtacagaa agtttaagga acttgcccag gacacgacac 54420 

gcagctatta agtgctagac ccagtcaatt tgagtctgac ttggactgtc tgactccaga 54480 

agccaccctc tcagacactg ctgtatactt ccagtgaatg ttgatgaaat tttcagggtt 54540 

gctaagctgt ggatttcaga tcctggattg tatgacctaa aagagagact tccctaggag 54600 

tgagggtccc tgaacagtca actggtttcc aagaatgggc tccctctcat caccttatga 54660 
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cagtaatcct ctgtccaaca gccaaagagg tcctgtgggg agggcttgca gatgggagtg 54720 

cgcagagccc agctcaaagc tcctgactag gctcttgttg agtattcctt tgattcctgc 54780 

ttctgtcttt ttaaatcaat ggagacaggg gagggttatc tccatcctcg gctcaagatg 54840 

aaatgcatcg ttcctcgttt ttctcattcc ttcccaatgt gtgtactgtt aactttagtt 54900 

atgaaggaaa ttacagtgtc ctgtgcatat accaaggctg tccaacctcc acacctttgc 54960 

tcaagctgtt ccttctactt gaaatgcctg tttccttccc ttctaattgc atctttccat 55020 

ccaggtagga atcagctcct tggttcatgg agccttttct gctctgtttt actatgcatg 55080 

gacttccttc tgaattagca gaggatgttt cctagcttgg tcttaaccct tctccttttg 55140 

tttgacctca atttactcat cttacaaatt aggttgtaag ctaattgaat acaggatcta 55200 

tgcttcactc tgattttatc tccacctgga tagcatcatt tttgacacac aagcaggcat 55260 

atgggagggg agagaagttt ggtgccagaa agaactggat ttgaattcta accctgttgt 55320 

ttacgtgagt acgttactta accattaatt acttcaatgt atatttatta agtacctact 55380 

atgtgccggg cactgtacta agcaccaagg atacaatggt gagtaaagag atgcagcctt 55440 

caccatcacg aaggaagaca gatgttaatc cattaaccaa gtaatctcac aagaaaagta 55500 

aaatgactaa ctgataagga caagcccctg gagctacaag agggtgtata cagggcatcg 55560 

atccaataag ggcagtgttg cggggagatc aggagccaca cagagcctgg gttgtctcac 55620 

ttggaaaatg gggtatcaac cacctacctc actaggtttt taaaatcagg ttaaatgagg 55680 

taatacttgc catgaacagt attttgttga ttgatgattg attgaaacgg agtctcactc 55740 

tctcgcccaa gctggagtgc agtggtgcaa tctcagctca ctgcaacctc tacttcctgg 55800 

gttcaagtga ttctcctgcc tcagactccc aagtagctgg gattacaggc agccacccct 55860 

atgcctgact aatttttgta tttttagtag agacaaggct ttgcaatgtt gaccaggctg 55920 

gtctcaacct cctgacctca aaagatccac ccacctcagc ctcccaaagt gctgggatca 55980 

caggcatgag ccactgcatc cagccacttg ccatgcatgg catttaaaaa tgttcagtaa 56040 

atgttaccat aatgaaggct ggtaggttgg ccaactgagt ggtctgattc agaaggaaag 56100 

aagttagaca tacgtgaaca tttcctgtac ttgaagatcc tcaggacagt gactcctaga 56160 

cccatcttcc atcacagtca gctgggaagc ttttaaaaaa atgcagacat ctgaccttca 56220 

cgctagacct attagccaag cagaagtttc tgggcagggc* atctgcatat ttttaaaaat 56280 

ctttaataag gcagcctcaa aattacagat tcagcacgca tttaccataa ccactgaaga 56340 

aatgcaaagt tataaaaaga agataaacaa caatctgtct* cctgctttct tccctctcct 56400 

cccctgcttc tggaggcaac aaggtcaact atttggtgtg attcctttta gcattccctc 56460 

catcaatggt cacataagga tgctcacaga taagcaccta tgcgggggtt ttttttttcc 56520 

ttgtaaaact attcacatac taaatacttt cctcagtatc ttgccttttt tcacttcatg 56580 

tcacagaaac atctcttcag gtttatagat acaggtccag ctcttctttt catagccata 56640 

taacattctg tagaatagag aggacacatt ttactcagtg tccgattgat ggatatcaat 56700 

attgttttca tttctacaaa tagtcaagga ataacataac tctgtaaaag ttttattact 56760 

tataggcgca tttatgccta aaggatagtc tcaaaagagt gaaactgatc aaatgtgcat 56820 

ttttttattt taataggtat ggacagattt gttctcaaaa tgtttgtggc agttcaaaac 56880 

accagtaaaa caggggagat atgtattttg gaaaagcacc caaggcgatt ctgaagtgta 56940 

gcccaggata agaaccattg cccagagctg ttccagatgg cccctgggtt cctgaagtgg 57000 

gtatcgggag agaaatcttc actgaatgaa tgagtgggct ccccagggaa gtgatgaaat 57060 

ggtccttatc agccttgcta tctccctctg acagaggcaa actctctctc cctgggggaa 57120 

gttcctccaa ggcctctata taagaagtct ttgtgagagg aagcaaagaa ggacctgggc 57180 

tttgggaaga tctaaagacc caggaaggtc tctgggtggg tgagtgcttt ctctgctgtg 57240 

gtggagctgg tgacagttta ttctcccagg aggtccctgg ctgtggctga cagtttctgg 57300 

agggctggca ggcgtctacc tgtggctttc aggttatgag gatgtcagca ggggcagcct 57360 

tcatcctctg ccttgcacat tccttctgcg ggatgtgaaa gtgctccttg gctggggaaa 57420 

ggagatggtg gagacatgga ggagggtgtg ggtggcttct tgaactctga ggaggggaca 57480 

taccttctaa gtcctatgtg ttcctaggaa agccaataat cattgcttct cccgcctttt 57540 

ttatgtcata gactctgagg gacccattaa gtacaaacaa ataagcgtaa tagtcccttc 57600 

tttacttccg ggcctgaagg aaagccagcc tcagccaccc ctcagggttt gctgcgttct 57660 

gtttagaaag aggtccttgc gtcctggatc ctggagcatc aggagctggg cttggcatga 57720 

gcttttctgg cccatcctga tttctattca ggccttcttt ttctccacct cactcccacg 57780 

gtcccctaat ggtgtgattg tgatgtgtgt gcatgtgtgt ctgtgtgtgt caatgacaaa 57840 

ctgtgttctc cgttgcagga taaagccaag atgaaactcc ccttacttct ggctcttcta 57900 

tttggggcag tttctgctct tcatctaagt aagtgttttt tgccttcagt ctttctttct 57.960 

ctgttttttc cctttctatg gtagatgggg tcagagttac acacccaccc ccttctttga 58020 

tcgtcttcta tttctgaatt tctgtgtgct taaagggatg gggactctat ggccaggagt 58080 

tgaaaggatt tctcaaggcg tctgttatgt ctgtggtctt ggttctactg tgacattccc 58140 

aattttgtcc tttctccatt atgcttactt tgagcttact gagtgccttc tctcctttaa 58200 

ctctcttagc atcgccatga agtaggtggt attgtatacc catttcacag aaatacagct 58260 

ggtggatgat ggaaccagta cccaagccca tgactgcccg actctaagtc catgctctta 58320 

accaccttga ccttgtcagg cagcttgggt tcccctcata gagactgggt tccaggttcc 58380 

ccttcccagg cagagttgag cactctgatg cccagggcaa ggtgtgagct gtctgtggtt 58440 

ctggggagga acaaggggag atgtgaagga aggacactta gctatcctcc ctgccagggt 58500 
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ctgagacttc cacctttgag acccctttgg gtgctaagac gctgcctgag gatgaggaga 58560 

caccagagca ggagatggag gagacccctt gcagggagct ggaggaagag gaggagtggg 58620 

gctctggaag tgaagatgcc tccaagaaag atggggctgt tgagtctatc tcagtgccag 58680 

atatggtgga caaaaacctt acgtgtcctg aggaagagga cacagtaaaa gtggtgggca 58740 

tccctgggtg ccagacctgc cgctacctcc tggtgagaag tcttcagacg tttagtcaag 58800 

cttgggtgag tggcctatgg ctgaggctga ggtgggagca tggaacgggt gtgggatatg 58860 

cccccagcat tgctatcact ggctcttttt cccattgagg gccctggggg tgtcagtaga 58920 

acctgagcct cagagaggtg ttggggtaag aggggagggc cacctacaaa cagaagttgc 58980 

attttggtct ccaaccttca aatggttgtg gcaggggagg gagggaatga attgtgggga 59040 

ctcaagaccc atgtgaattc atgtaggaag gatgctccat tctttgtctt ttatcctgcc 59100 

ctgtagttta cttgccggag gtgctacagg ggcaacctgg tttccatcca caacttcaat 59160 

attaattatc gaatccagtg ttctgtcagc gcgctcaacc agggtcaagt ctggattgga 59220 

ggcaggatca caggctcggt aagagaagtg tgaacactaa atggggtgca cctgctgatc 59280 

tcagccagca ctcagcttgc atcagatttg tctgtttttc tcctgtataa tctccagaag 59340 

aaccagggat agatggacac ccacagacaa cactgagggg gctgcctggg cattcaggga 59400 

agagctaagg atttagaatc aggaggtttg ggtccaagtt cctttccatc tctcactatc 59460 

tatgtaactt aagttagctg ggcatggtgg tgcatgtctg taatcctagc tacttgggag 59520 

gctgaggcag gagagtcact ggaacctggg agacagaggt tgcggtgagc cgagatggag 59580 

ccattgcact ccagcctggg caacaagagc gaaactccgc ctcaaaaata aataaataaa 59640 

taaataaaat aaaaaaaaaa ttaaaacaag accatgagtt tgtttcctca tctctaggat 59700 

gagttggcaa cccttgttct accttttgtt agggctggaa ggacaagcct gtcactggga 59760 

tgcatagaat ctgatggtga taattgccgt ggatcagcat ttcagatgac taggacagtt 59820 

cccatcatgg tccagcaggg aagggcccat tgcccggtgg gcagcagaaa gagctggcag 59880 

atacggggcc aggtctgctt ctctgccttc cctctgcccc atcccttctt cccctcttgc 59940 

tttctccagg gtcgctgcag acgctttcag tgggttgacg gcagccgctg gaactttgca 60000 

tactgggctg ctcaccagcc ctggtcccgc ggtggtcact gcgtggccct gtgtacccga 60060 

ggtgaggtgg ggctggggat gaacgatgga aaggtctggg- agatgggaag tgccccaagg 60120 

aggagatgct acaaagagcc tgaccctttg tgggagaggc ttcctgggtc ttttatatac 60180 

tctgactcca cagcagtgtg tgggtgggaa aagaggcccf cctgtgggtt gagttgggat 60240 

ggacaagagg ctgaaagtcc ctttctgttc tgccttcaca ggaggccact ggcgtcgagc 60300 

ccactgcctc agaagacttc ctttcatctg ttcctactga gctggtccca gccagcagtt 60360 

cagagctgcc ctctcctggg cagctgcctc ccctcctctg cttgccatcc ctccctccac 60420 

ctccctgcaa taaaatgggt tttactgaaa tggatttatt ttctcctctg atcgcggatc 60480 

cactctgctt agccctcatt gaaacttctt ccttatcatc tctccccaca ccacaacttt 60540 

catagaagtg tcagaagcta ctactccttg aggaggagga tggagggtgg agttgggtct 60600 

atggagcctt ttggagatgg aggaatgggc tcagctagtt ctcttcatag aacacctgat 60660 

tactgggcac ctgcatagtg ctgccaggac ctttcaaggt tgtaggtaga ctcccaatgg 60720 

cccagtttgc atctctgtaa ccaaaggcct tttctctctc tctctccaac cccagaactg 60780 

tggttggttt tatatgtaag gaagttaaca tgtccctggg aacagtccac aacattcagg 60840 

aatgaatgta taagtaccgc aatccccggc ccctcaagtg gaataaatct aacatgtatt 60900 

gggcaccatt tcccagtggc ctgctgtggt agttggcctt attccatgca tttttatggg 60960 

ctgccttccc ttcctcaact gcattctctg ctccttccta ctctctgcaa ctcccaaata 61020 

aacacttgta cgcaactccc tctctcagga tctccttctg gggaaacctg atataagaca 61080 

gcttgccatg cgtcagactc tgaatgaggc ctgggaatac aagacatagt cctctggcac 61140 

ttgggatata tggttatttg taacataggc acaaaaacat ctactagttg ttatcgctta 61200 

ttgagcaccc acaacatacc ccctgctgtg gcaggcacct tgcctagatg acctcatgtg 61260 

atcaataatt atgagcccta ttttacagaa ccaggctcag agaagttagg atctgtcaaa 61320 

agacttgccc aagactgaac ctctaaatgc aactcatatt gaaattcaac tctgctccaa 61380 

agcatgttac tttaaccctt gtgcttttac agctggctac tctcccctta tggtcacacg 61440 

gggatgaagc acggggggag gaaagccaga ctgtctcact cttgggttca tcttgggaca 61500 

caggacacca gcccagctgg aggtgaggga gctttaatca gaggggaggg aggaaggcat 61560 

tctcaacccc ttctgtacta gggaggtcag cagaagaaaa taattcaatg ttctaaagcc 61620 

atttttttct ccagcattcc tccaattcat agatcttcat atgggattag gggctcagag 61680 

aggggtgaaa caagaactct atttttttgg agtgtggtat agagaaggga tgctacttct 61740 

ctaaggtcac atagtaagtt gagaaagaga gagaaatcaa actcaggttc atttcaacta 61800 

ttgttccaca agaatctgtt gatttcaaag atggtggact atgggttcat ccctgtggtg 61860 

agtgctgtga ggatgcagct gaggtggaac tttcactcct tgccctcttg gactttatat 61920 

tctggtgtgg aaaggcattg cttcccttat ttcaatatta acaacaaagg gtaataatat 61980 

ttcccattta ttaagcattt actaggtgtc aggtactgtg ctaaatgtta ggtgaacttt 62040 

gtcttgttcc tcataaatct ctgccgctgt gggtgtgtac tttgacagaa gtttgacttc 62100 

cagtccacag agatcttctt tgggggagta atatcaagaa ggggcacgaa ggaagctgca 62160 

gggctcctag tcccatcctg tatctcgacc taggcatgtt tacattggtg cattcactgt 62220 

gaagtttccc tgagcagtcc actctatagt gtgctttata ggagcacatt gtacatccat 62280 

tgaaaaattt ttcttggccg ggcacggtgg ctcatgtctg taatcccagc actttgggag 62340 
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gccgagacag gcggatcacc tgaggtcggg agtttgagac ctgcctgacc aacatggaga 62400 

aaccccgtct ctactaaaaa tacaaaaaaa ttagccgggt gtggtggcac atgcctgtaa 62460 

tcccagctac tcaggaggtt gaggctggag aatcgcttga acctgggagg cgaaggttgc 62520 

agtgagccga gatcgtgcca ttgcactcca gcctgggcaa caagagcgaa actccgtctc 62580 

aaaagaaaga aagagatttt ttctttttct taaaaagtaa aaatcatgaa ataaggggac 62640 

tgggctaata ttccaaaata tgggtttgtg tgtgaatttt cctctccagt aagatactaa 62700 

ctaagctctg tgaaactgtt tatctatggt tctttatcat tgaatccttg gagttcctta 62760 

cactgtgcag agcacagagt aggggctcaa tcaacagtgc actcattgct ttttcataga 62820 

caagggccac cctcactcaa ctcatgtgcc aggcatagtt ctgagagctt tgcttaagct 62880 

gatnnnnnnn nnnnruinnnn nnnnrmnnnn nnnnnimnnn nrmnnnnnhn nnnnnnnnnn 62940 

nnnnnrmnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnggtcact aatttggtat 63000 

agagttatgt tcattgattt cattttattt tgtttctgat ttttaaagat tgttttactt 63060 

gttttcttcc tattattatt ttattttatt tgtaaaacat ttacatatca gacatttaca 63120 

ttttcccaaa ggtaaaactg tgaaacaaga tatattcaaa gaagtttact ttccctctct 63180 

gtttcttgta ccccttttcc tcttctttag gtaaccattt ttattttttt aaatataaac 63240 

attgtgtagg tgtatataca tgtattagtc tgttttcatg ctgctgataa agacctatct 63300 

gagactggga agaaaaagag gtttaattgg acttacagtt ccacatggct ggcaaggcct 63360 

cagaatcatg gcaggaggtg aaaggcactt cttacacggt ggtggcaaga gaaaaatgag 63420 

gaagaatcaa aagtggaaac ccctgataaa cccatcagat ctcgtgagat ttattcacta 63480 

tcacaagaat agcgtgggaa agactggccc ccatgattca gttaccctcc cccactgggt 63540 

cccacccaca atacgtggga attctgggag atataattca acgtgagatt tgggtgggga 63600 

cacagccaaa ccatatcaat acatttccct ctcttttaga taaaaggtag tatactgtat 63660 

acactattct gcagagtttt tttttttttt gatgtaactc tatcctgagg gtgctctgta 63720 

gcagggacct ctcatgcctt ttaaccactg cctgggtctc cattacatgg ctgcagcata 63780 

gttgccacag cattcctgta ctgatgacta tttggattgt ttccagtctt ttgctattac 63840 

cagtagtgtt acaaagagga tctggctaca tgttcagggt ggggaggggc agatgtgtag 63900 

cctgtcagga gggtattgca gtaatccatg actgagttaa • tggtagttta aagctaggat 63960 

gagtcagtgg ggttggagag aagtgggcac atttgaatga tatgtaggag gtgaatgatc 64020 

agcattattg atgagtttga ggtggggcat gtggggaaag gattcgagga tgactcccag 64080 

gtttctgttg ggacagtgga tggatagtgg ctcctcccct ttttccaatc ttccttggcc 64140 

cttcgctgac ttctgttggg ttggcctaca gagagcttct ttttcctctc tgttcgccca 64200 

ggttcctcca ctttggcggt ggccctctgc tcgacggtgc cttcgctggc cctgacatcc 64260 

ctgctgtgcc tgggcttcgc cctctgtgcc tcagtcccca tcctccctct ccagtacctc 64320 

accttcatcc tgcaagtgat cagccgctcc ttcctctatg ggagcaacgc ggccttcctc 64380 

acccttgcgt aagtggcctt ggggcgggct ctgtggagac ggacacactg gggcaaagag 64440 

aagctggagg taaagaaatt gggaggcaag gcggggcctg gaggcagtca ggtgcgggag 64500 

actgggtttg ggggcaggtg tggagggggt gagaccagag gtggtgggaa ggatagaaca 64560 

ttcatgcact tgagccttta catctgcggt gccctctccc tctgttttct acctggtgaa 64620 

ctcgta'ttca tcctctgagg cccacttctg tttcagttct ccagggaaga aatggaaaag 64680 

tgtcttccct tctttgtgcc cttagtactc tagtcttact tcctttgcta gtgcgtgcat 64740 

tgtctggcat gccatccatt tacatgcctg tcttttcttt cctggtgcag cctgcatgag 64800 

ggtcctgtct gtttttccag ggccccgcat gtgccttctt ctgggttctg tgggtcaaat 64860 

gtctgagcag agctgaagag ggaaaggcca gacaggtgtg gttggagggc aggcctagga 64920 

caggggagct ggggacaagc ggccgacagc ccccagaggc caggcttctg cttggaggga 64980 

gggtccctga agctcactgg aacccctctg gtttctctcc ccagtttccc ttcagagcac 65040 

tttggcaagc tctttgggct ggtgatggcc ttgtcggctg tggtgtctct gctccagttc 65100 

cccatcttca ccctcatcaa aggctccctt cagaatgacc cattttacgt gagtactggg 65160 

aggatgggga tccctggcag gaggcctggg ccttaggcct tggctgcccc aaatctggct 65220 

gtgatggcct gggtatgtag catggtgcag cttcccaaag ggtctgtgtt attcaagtat 65280 

ttggggcaaa agtatttgtg tgtgtgggga aacagacatt ctggagtagg gtggggaatt 65340 

ctcacgaaac ttcaagcaaa atcctgagac ctcaaaggtg tttcctgctt gtggtgagtg 65400 

caggcccacc ctggcctctc ccctaggccc acacagggtt tccacagttg gccccaggga 65460 

caggacctct gtgctttcac ctctgtgtcc ttacacctgg agggatgctc tgaggtcctg 65520 

ctctaggagg tggtcgtgag tctcctgctc tttgcagaaa ctgaggctca aagaggttac 65580 

ttacgtgttc agaggcacca gctaaggagc aaaagtcaac tttgaattct gtgttttgac 65640 

tactgcacag ctctatttgc ctcatttttt atttttaaag cagcaaatct tagaatagga 65700 

gtttaaatcc atcacttgga gaaaagaaag actaaatgtt ttttgttttt gttttggaga 65760 

cacgatcttg ctttgtcacc caggctggag tgcagtggca caatctcggc tcactgcagc 65820 

ctcgatctcc tggactcaag cgatcctctc atctcagcct cctgagtagc tgacactaca 65880 

ggcatgtgcc accatgccaa gcttatttta ttttattttt ttgatagaca ctggggtttc 65940 

gctatgttgc ctgggctggt tttgaattcc tggcctcaag cgatccaccc gtctctgcct 66000 

tccaaaatgc tgtgattaca ggcgtgaacc actgtgcatg gccaaaagag taaacttgaa 66060 

atctgaggcg aatgacttga ttgtgacatc aggtgaccta gtaatcagct gtgtattcta 66120 

gctggtgcct ctaccagctt cccatgtgac cttgaacatg tcattgaatg ctcgctaggc 66180 
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ctctgtttct ttatctgtga aatgggcttg atattcctcc tctaccccaa ccgatagtgc 66240 

agaatgaaaa gtaactgaaa gtccttcctc cagggcacca tagtgtctgg gtgaaaagta 66300 

gaatataaac tcggtagact tctggtccct tcattggtca tggaatggac cagtgcttgc 66360 

ttcattgagc aacagttctg ttgttcagaa ttcctggatt tcacctcact tctgctctcc 66420 

ctgcaggtga atgtgatgtt catgcttgcc attcttctga cattcttcca cccctttctg 66480 

gtatatcggg aatgccgtac ttggaaagaa agtccctctg caattgcata gttcagaagc 66540 

cctcactttt cagccccgag gatggttttg ttcatcttcc accacctttg aggacctcgt 66600 

gtcccaaaag actttgccta tcccagcaaa acacacacac acacacacac acacacaaaa 66660 

taaagacaca caaggacgtc tgcgcagcaa gaaaagaatc tcagttgcca agcagattga 66720 

tatcacacag actcaaagca aaggcatgtg gaacttcttt atttcaaaac agaagtgtct 66780 

ccttgcactt agccttggca gacccttgac tccaggggag atgacctggg ggaggaagtg 66840 

tgtcaactat ttctttaggc ctgtttggct ccgaagccta tatgtgcctg gatcctctgc 66900 

cacgggttaa attttcaggt gaagagtgag gttgtcatgg cctcagctat gcttcctggc 66960 

tctccctcaa gagtgcagcc ttggctagag aactcacagc tctgggaaaa agaggagcag 67020 

acagggttcc ctgggcccag tctcagccca gccactgatg ctggatgacc ttggcctgac 67080 

cctggtctgg tctcagaatc acttttccca tctgtaaaat tgagatgaat tttggtgttg 67140 

aaagttcttc ctggagcaga tgtcctagaa ggttttagga atagtgacag agtcaggcca 67200 

ccccaagggc catgggagcc agctgacctg cttgaccgaa ggatttctga cagactatct 67260 

ttggggatgt tttcaagaag ggatataagt tatttacttt gggcatttaa aagaaaattt 67320 

ctctcgggaa taattttata gaaaaataaa gcttctgtgt ctaaggcaac tactgtttcc 67380 

atctctctag gctttgggcc ggggctgtgt gtgtgtgtgt gtgtgtgttt gtgtgtatgt 67440 

gtatgtttct gaggaggccc taccctggca tgagagggta gggaatctgg ctacacatct 67500 

agtgtggcag ctggacccag aggtggggca ggaaccctga ctatgattca ccccgctggt 67560 

cctgggatgt gggcccagag acttcctccc ccaggaaccc ctctgcttcc tcttcctctc 67620 

cacatcctta actaacttta gcagaaccct actcctcact acacaccccc agctagaagc 67680 

gctggatgga atcagaaatt cctagtttga gtttcaattc tgcccctcag cagctgggca 67740 

agccccttaa ccactctgag tcactagttc cccacctgca- aagtgcagtt aatcatttct 67800 

atctctgatg gcgattgtga gaatgtaaag tcattgcaac tgcctagcac atggtaggag 67860 

cacatgaggg tttgctcctg tgtttactca tgacccttgg ggaggacggg ggcaaagagg 67920 

gagaagttga gggtgcagga ggagagatgg caggtgggtg ggatgggaga atctggggca 67980 

cacctgctgt ctcattccca ccttgctagg agagggacta ggaaagaaca gtgggaggca 68040 

gggggatggg ggtggaaggc agggggtggc aggcaggttc atccatccat tcattcaaca 68100 

aatgtttatt gagcacctgc cacgtgtcag gccctgtcct gggtgctggg gctataaaga 68160 

tgcagaaggg tctgaaaccc agctcttcct tcttcctgtg gatgtcgggg tgtaatttcc 68220 

aggggccagg agcctgggtc tgagggcgga caccaaagtt ctagtggtgt ctattagcag 68280 

cgtttaaatc taatggatgg atttggtctt gttaccctgc tcaaaagctt tcagcagctc 68340 

cccactgtcc acaggacaaa aatccagatg ctagcctggc attcaaggct gtcactagtg 68400 

tgatctcaac ctctcccctt ccctctttac ctcctaccaa cagcggggca gagcccaccc 68460 

ctgtggacca agattcccag tctctgggtc tgtgtgtgca ccagttcctc tgcgtgggtg 68520 

gctcaccctg cctcagcttg tgaaatccat ctggtctgct gggatcctgc tcaaaatgtc 68580 

atcttctcca aaaatcatta ctcaggcttt ccagcatgtc tgagtccctg gcacttggtc 68640 

acacccttcc tggtgactgg catttgcctc cacatcatga ccctcccacc ccttgcctgg 68700 

gcagcatact ccaggaggca aggtctgttc tcgcctggct ctaattaatc tgtgcttacc 68760 

atccacatgg taccagctaa ttcttgttga atgaatgatc gttg'aatgag tggattcttg 68820 

ttttggcctc agaaccaatt agaaggagcc agaaaaacac atgggggtgg gggaggtgca 68880 

gtgtggtgca gtggaaaaaa acccttctgg aaatctcagc tctgtcactt actttgtcag 68940 

ctctgtgact ttggatggac cacttctttg tcagtatggt gggagaaata gacatgcctc 69000 

tctgggctgt tgtaaggatt acaaattagg tcgagtgctt ggcatgtggt gggttgaaca 69060 

gatcacagct agcattacag atgatatatt aaagccaaaa aaagatgcct aatgtccacc 69120 

agttggtgaa cggacaaagg aaatgtacca tatttgggat attatttggc aatcaaaaaa 69180 

agtactgaca cctgctacaa cacggatgaa tcttgaaaac attagactaa gtgaaagaag 69240 

ccagacacaa gaaactgcta atgattccat ttaaatatga aatatcgggc cagggtgcag 69300 

tggctcatgc ctgtaatccc agcactttgg gatgccaagg tgggcagatc acttgaggcc 69360 

aggagttcgt gaccagcctg gccaacatgg cgaaaccccg tctctactaa aaattagccg 69420 

agtgtagtgg catgcacctg taatcccagc tacttggttg gctgaggcac aagaattggt 69480 

tgagcctggc aggtggaggt tgcagtgagc caagatcgtg ccactgcact ccagcctgga 69540 

tgacacagtg aggttccgtc tcaaaaaaaa aaaaaaaaaa ggaaaaagaa aaaaagaaat 69600 

ttccagaata ggccaatctg tagaggcaga aagtagattc atgattgggt aggcctgggt 69660 

gtggaggcca tgggtagtga tggctaatgg ggaaggggtt tcttttgggg tgatgaaaat 69720 

gggtggactt atggtatgtt aattatacct caataaaact gttatttaaa ggaagaaaag 69780 

atgcctggat tccccaggaa gtgtacagta gacttctgtg agaatcagaa atgatttctg 69840 

gggaagatgg gcgagaggag agtaagtggg agaagtgacc acgtgcgcaa ctctcatcgt 69900 

tctgccctga gagccttcct cctgcaactt tatttattta tttattttga aacaggttct 69960 

cactctgtta ccctggctgg agtgcagtgg tgtgatctca gctcactgca gcctcgacct 70020 
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gccaggctca agcaatcctc ctgtttgagc tcctgagtag ctgggactac aggcgcatgc 70080 

caccacatct ggctaatctt ttatttattt atttatttat ttatagagat tggggagtct 70140 

cactctgttg ctcaggctgg tgtcaaacgc ctggactcaa gtgatcctcc caccttggcc 70200 

tcccaaagtg ttgggattat gggtgtgagc cactgtacct ggcacctcct gcaacttctt 70260 

cctcaagtgg aaccaatgag gaagcaagca actcagagct ttcacaagtt ttgatttcaa 70320 

tcagcaacgg gcttccaatg caacccttct ctcctgtaac cagcctcagt agagaggaac 70380 

tggaggtgaa ttggccccca tcacaccccc acagtgccaa gctgggccct tccatcaggg 70440 

ggagaacaca tgccgtgtaa gggacagcca acagcataaa ataggaattg tgtgatgatc 70500 

ccttttaagc ctattcagcc cagggaagtg catatgatca gccccatttc atagatgaag 70550 

aaagtcaggt tcacccatta gcacattgtg gggctggtat ttaaaccagg tctgtctggc 70620 

tcccaaggtc acattcattt agacattacc tttactttac atttcttctt cttttcttct 70680 

tcttcttctt cttcttcttc ttcttcttct tcttcttctt cttcttcttc ttcttcttct 70740 

tcttcttctt cttcttcttc ttcttcttct tcctcttctt cctcttcttc ctcttcttcc 708 00 

tcttcttcct cttcttttct tcttcctctt cttcctcttc ttcctcttct tcttcttctt 70860 

cttcttcttc ttcttcctct tcttctttct tcttcttctt cttttttttt tgaggtgggg 70920 

tcttgctcta ttgcccaggt tgaatgcagc atcatcatac ctaaatgcag ccttgaactc 70980 

ctggccttaa gcaatccccc tgcctcggcc tccaaaagtg ccaagatttc aggcatgagc 71040 

caccatgccc agcctgcatt tattctcttg taagaaagat atcatttaaa acagacgaga 71100 

aaataaagag ggacatgaaa aagacgcatc accattaatt ggaccactca gagataatca 71160 

tggttaacat gttggtatgt tccctcccgt catttgactg gatgtatgtg ataatttaaa 71220 

tgatctcata agcttttcct tatgtaatca aatagtagcc aaaaacatga ttttaaatgg 71280 

ctgctcacaa ccccatctcg tggttctgcc acgccttgtt tatccccatc caccccctac 71340 

tccctttccc cttccctgcc tgtgtggggg tcctagatga cggtgagcca gagggcagcc 71400 

ttggtcagca gattggagag tgcaaataat aaaaacactc agaaggcgag ctgttgtcaa 71460 

gtgggcttat cacaaaagag caccttggga tattccagag aatgacctca tacccgctaa 71520 

tcactatcca taatctggtg ctaactgtac tttagctgaa ggtgctggca ggtcctgccc 71580 

aggtgctgct aagaacactt ctattctgtg agaatcagag atgatttcta gggaaaatgg 71640 

gcgagaggga gtaagcagga gaaacaaccc acaggcacag ctctcatctt tctgccctga 71700 

gagccttcct cctgccacgt ggttttgttt gtttgtttgt ttgtttgttt cagatagggt 71760 

ctcactctgt cacccaggct ggagtgtagt ggcaagatca tggctcactg aagcctcgac 71820 

ctcccaggct caagcagtcc tccccaaatt caaagcttgg agtgatggtc ccagtggtta 71880 

tgtctaggag ccctttttcc tgccagcccc tcaggggatt gatgactctc aaatgcttca 71940 

ggtgtgacat gggcacagca gtgagtcatt cctctgacat tctttgggaa gaacattttc 72000 

catccaggct tccaggcata agatccagtc ctctggtgat aaggagttca cagacaggac 72060 

aatgtctgag tgtatcttaa acccaggacc atggcttgtg ttcacaccag accctccagg 72120 

gattttgagg tgttttgttt gtttgtttgt ttgtttgttt gttttttgag acagagtctc 72180 

tctctgtcgc caggctggag tgcagtggca cgatctcagc tcactgcaac cttcgcctcc 72240 

cggttcaagc gattctcctg tctcagcctc ctgagtagct gggactacag gtgtgcacca 72300 

ccacacccgg ctaatttttg tatttttaat agagactgtg tttcaccatg ttggacagga 72360 

tggtcttgat ctcttgacct cgtgatcctc ccgcctcggc ctcccaaaat actgggatta 72420 

caggcatgag ccaccgtggc ccgcccaatt ttgagttttt atgttctaat cccaaacatc 72480 

tgctcacagg cccctcagca tattctttcc tgggtccagt gtcacctccc aggcctgcag 72540 

gctggctaga gcagtagggt gtgtgggaaa gctctgggct ttgcaggcac tgatcagctg 72600 

tgtgacctta accaccctga acctcagttt cctcacctgt aatggaaata ggtaccacgg 72660 

cagtttgttg caaggactag agagtaacct tgggaataaa aggtagcagc agcttgggct 72720 

ctggagatgg actgtccaag accaacttcc agttcctccc cacacaagct ctggcactta 72780 

gattcctggt acctccgctg cttcatctgt aaaatggagt aacaatagga atactttata 72840 

gagttgtaag gattgagtgg ctggatgaac gtcaagcact tcaaagggga cctggcatgt 72900 

agtgagtgat caatataaac cacctggctt gtagcaggtg tgctgtgtgt ggctgcaggt 72960 

gttattagta acatctgtgt gcccttcaga gcgtgcacca cacttcacac cttgtggagt 73020 

ctggaatgcc actattatag ttcaggatag aaaacctccc tgcaagcact cgctttagct 73080 

tgtctccacc gaacaaaaca acacaagttc tttattactt ggaatgggaa aacttcaaag 73140 

gcaaaaaaaa aaaaagactt tcgagttacc ccaaatctta agccaaagtc aatgaaaaat 73200 

atcaatcttc atattcaatt tttgcgatac ttttgtctcc ccagcagtca atggagagaa 73260 

tccaagcaca cagaaatgtc aattaccagg ggcagggcta tgaattcctt tcagagccct 73320 

gggctgggga agagtgcagg cagacagatc tgggtcctgt tatcacgttc ttagattggg 73380 

tgtccttgta ggagtcatga agcatcttag tgcctttgtt tgctacctat aatgcctacc 73440 

tcagagagta ataaggataa gtaaggctct acgtgaaaag tgctcggccc tggcacatag 73500 

taggtccttc attaatggca gctactaatt tttattacat acgcaaaatc acattacagg 73560 

tcaagtacgc tacatgacag tgaaacagtt tttttgtttg tttgttttga gacagagtct 73620 

cgctctgtca cccaggctgg agtgcagtgg cacgatcttg gctcaccgca acttctgcct 73680 

tcaagcaatt ctcctgtctc agcctcccga gtagctggga ttacaggcat gtgccaccac 73740 

gccagctaat tttttttggt atttttagta gagacggggt ttcaccatat tggccagact 73800 

ggtctcaaac tcctgacctt gtgatctgcc caactcagac tcccaaagtg ctgggattac 73860 
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tggcatgagc caccgcacct ggctgtgaaa cagttttatt gtgtttctgt ggaatgtgtc 73920 

ctacccaacc tatagctaac tcctatagtt ccctcagttc tcagctcaga tatcccttcc 73980 

tttctgtact gttacctagt actggttttc atagcaccag gtacctctct ggcatagagc 74040 

ttgtcacagt tgcagtttaa tgtaccatca taggatttta aaaatattca gttgtgtctt 74100 

ccattaggct ttcatttggg aactccacgc aggcagcagc tgtatatttt gtattgccta 74160 

ctgtatcctg agaactttgt accctactta gcacagaatg gaggctcagt aaatactgga 74220 

catgagagag agagagagag agagaggaga gggagagaga gagagagaga ttcaacctac 74280 

aatcccagct ctgagcttct agttccctga tggtgaggac tgtgatgtgt ctcacacggt 74340 

aatgagcact tatgcagaag aggctcagaa aatttctcct catggccaac ggaagactta 74400 

gagttctttt ccaagctcca ccgtttgctg gcatgcaaaa tttggactat cacttaagtt 74460 

ttccaagcct tgctttttct atccctaaca taggacaata ttcagcattg ttgtttgttt 74520 

gttgggggca ccatgtttca ggcacttagt agattattgt accaccacat ttcaattggt 74580 

cctcctcaag ccctgcaaca tctgtgaggt ggtcatcctt aacaactcac agatgagcaa 74640 

caggagactg gggggatgag ggaactgcca aggaggtcca gcttatgggc agcagagcca 74700 

agaatggaac cagggtcttt tattttttta tttttttatt tttatttttt aaccagggtc 74760 

ttttaacatc cgaggaccac attctttgtg ctttccaaat catcacctgc cccatgcaac 74820 

ttacagggta agttacatta aacaacgtat gtaaatggct ttgtgctagt tattcaccac 74880 

cacaggggaa gtgagtcacg gacaagagtg cagccgctcc attcggatcc tggctctgac 74940 

acttacctgg aaaatgactt aaccattccc aggatcagct gtttgtctgt aatttaggta 75000 

gtttaatggc acttgtgtcc tagagttgtt tagaaggttg aataatatgg agcacttaac 75060 

atacttagca cctagaaaca cttcctaaat attagttgct gctgttgtta tcgttattaa 75120 

aatttctgcc taagatctca tttcagggag cccaactcaa tctttgacaa gcttaaacaa 75180 

aaattgcttt tcttcattta ttcacttaca cagcaaacat gaattgagcc tgtactgtgt 75240 

ttccagaact gtgcaggacc agagaggcac aggtgaagga agcaaggctc tggctctact 75300 

ggggaaacag caagaagatt gctacaatga ggtgggaaga gggctggact agagagaagc 75360 

cctgattagt gtccttgcta cctttctctg ggagagccaa ggcaggcttc ctggaagagg 75420 

tgatccttgg ctgaaacttc gatgaagaaa aggaaagagc- gcagtggtta gggaggaaag 75480 

ggcattctgg gcagatgaaa tgacatgtga caaaatatgg gtgatcannn nnnnnnnnnn 75540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 75600 

nnnnnnnnnn nnnnnnnnnn nnnnnnngca ctccagcctg ggtcactgag tgagagaccc 75660 

tgtctcaaaa aaattaaaaa aaaaaagtcc agaagaacat ttgggtctca ctctgtggcc 75720 

caggctggag tatagtggca caatcatagc tcactgcaca ttcaaactcc tggcctcaag 757-80 

tgatcctcct gccttagcct tgaaataagc ttggattaca gatgagccac cacacccagc 75840 

cagaccatta ttcataatag ccaaaatgtg aaaacaaccc aaatctccat caactgacaa 75900 

atggataaat agaatggtgg ttgatccata caatggagta tt tact cage aataaaaaga 75960 

agtcctgata catgctacaa ggatgaacct cgaaaacatt atgctaagtg aaagcagcca 76020 

atcacaaaag gctacatatt acaagattcc atttaaatga aatgttcaga ataggtaaat 76080 

ctaactttta tcacaggcaa agctatgaca ggaaatagat gagtggttgc ctagtgcttg 76140 

ggggcagagg tgggggtgag gcgagtgagt actgctaatg gtacagagtt acttttgggg 76200 

ataaagaaac tgttctgaaa tggactctgg tgatggttgc actactctga acatactaaa 76260 

actgttaaat tatatacttg aaatgggtga cttgtgaggc atggaaatta tatcttaata 76320 

aagctgtttt acatatttta catatttaaa aatgcaggtg gagggatgag ccctctaaag 76380 

agaagcagga gtttgaggag gttctaaata ttgtgtggtg ggtactgagg catataaatt 76440 

tgtcagacct catcaaaatg tatgatgtaa tcttcaagaa agttgatttt aaagaaacac 76500 

caccagcacc aggtggagaa ggcaggaaga agttacacaa ggggtaggcc aagagtggtg 76560 

gctcatgtct ataatcccag cactgcggga ggccgagctg ggtgggtgac ttgaggtcag 76620 

gtgttcgaga ccagcctggc caacatggtg aaagcccgtc tctactaaaa atacaaaaat 76680 

tagccagacg tgctcgcgtg aacccagggg gagaaggttg cagtgagcga agatcatgcc 76740 

aatgcactcc agcctgggtg acagagtgag actctgtctc aaaaaaaaaa aaaaaagtta 76800 

cataggggac agtggcaggt gtcaagggca ggcagggtct ctcctatctc caggataaac 76860 

tcatagggga cttagatgcc atgtgggtcc ctaatagccc tccacttggt tcttgcagcc 76920 

actcttatgt gtatcatttc atgtcaggcc tcttcttccc aacccaccca gccatcccag 76980 

cctggctgcc aaccccacct cctccagccc ctgtcacccc ataattgggg ccaggaggca 77040 

tgggagagtc gccatctctc ggtgccatct gttgcatctt tacagataac catggctgga 77100 

tgcggcagat cctggggtgg agcagccgct gttcagagca gtgatcaaga cctccccatc 77160 

tccacccctc aaggaatcgg ttttcttcca tagccacatc aggtgctgtg caggaaggag 77220 

ttgaaacgag aagccaggag caacgagaag gacactaaca tttattaagc actgcagact 77280 

ctcacagcac tcccacggaa tcgatattat tatccccatt ctgaagacca ggcaactgaa 77340 

gctcaatgtt taaggaactc accgaagtca ccaactgata aaagtgatgg aagctgggat 77400 

tcaaatccaa gctaaacttc cttccaagct tactccacaa cacagaggtt ggggaaaggg 77460 

gataaaaaga gaggggagcc caattccatt tccacccagc tcctgaggcg gagcttgtca 77520 

gcacagctct ctccttccca gaataggaag atacccatca gaggcaagtc ctagacacca 77580 

gcagtggtaa ctccctgccc caaggcagct gcagacagcc tatggctgta gttactgctc 77640 

ccaaagagtg ttagaattcc cactcccagc ttcggggcca ctcacacaag gtgattgaag 77700 



46 



wo 02/29059 



PCTAJSOl/31488 



tggaaaccag agactctcca caatgccctc ctagagtaaa tgaggctatg taactttgtc 77760 

caaatgagta atttgaaaac ctgggggctc ccagctcctg aaaagggaag gatgtggggc 77820 

cctttatatt catactccac tttgtgcagc tctcccttgt cttatgatag ccctattaag 77880 

aaattcctct cccagcacgt ctccttcaaa gagctctaga cctgaggctg tcagaggctt 77940 

aggactctgc ctattagtcc cagggtctgg atgaccagca ggacacctgg cattcagtga 78000 

ccactggatt agataaatga aacagtgggc agagtgccac ccaatctccc cctgaagttt 78060 

gaagaggtcg agaagtgagg ctgtccaact gctgaccctg ctttctgtcc acctggccac 78120 

ctaacctttt ctggcttcca cctgcccctt tgccatccct ccccccagcc cacccagccc 78180 

attttcaggc atacctgggc acgtgctgga atagaagccc tcgttcttca gaatgatcaa 78240 

cagggagccc cagcccagga gtacagcaga gaagaagagg ttctccagca cagccgtgca 78300 

ggccatccac cagcgcctcc ggtacgcctg ttgcagcgtg ggggccatgc tggccccgag 78360 

cctgcacaga aacagagcgc tgggtgaagg gccccccagt ggccccaggg aagggtcctg 78420 

catcatggtg gcacccgaga cctctcgggc cagcccgcga ggagcccctc atggaggccc 78480 

catagagccc tgggcttccc agccggtgcc aaggagctgg ctccgcgcgc actagcagtg 78540 

ccagaggtgc acgcggcacg gggctcccgc tgagccacta tcggaaacaa ggaaggtcct 78600 

gtctgcgcgc tgcagcttcc tagcaggctg ccgggttctc tcacccaggc cagggcgctc 78660 

agggccgggc tgctggggag aaagtccgca tctgcccagg tccccagagg acagcaaggg 78720 

gcagagcgcg ctctgaagca ccgcgggccc atgtccggac tctcgcgcca ggaaagaccc 78780 

ctagaagctg gcaggaagaa gggcaagttc aaggctaccc tacgacccca tcttccagtt 78840 

gcccctccaa gacctctcct tccctctggg gccgggcgac agcaagccct ccccctttcc 78900 

gtatcaggtg acccacgacc ctacagtctc tcgggccaag ccaacagctg ccacgtggag 78960 

ggagacccag gacgggctct cctcggttcc ctcctccccc gcgcgcccct cactcactcc 79020 

gcagggctcg gggcaccagg ctttgcacct cggaacccgc ttgcccccct ccagccccgg 79080 

gagggggctc ggacttcggc aggaagtctg gcggctgctg actttataag ggcagcggtg 79140 

gcggatgggc tggcgggcgg gtgtgtttac caaagggagg gaaagagccc cagctccccc 79200 

cgccgcggcc gctgcagcct cggcgggagg agagggaacg cgggcagcgc gggggcgggg 79260 

agcgacaact gggatgagac cgaggaaagc ggagaggaga- agggcaagaa agacccagag 79320 

agaggggagg aagtaccagt cacttcttcc agggggactc ggtattctca tctgtgaaac 79380 

ggggctttgg gttcaagcgc tccaggaggt ccgctggaac" tctggcaaac gcgcagctct 79440 

aagcagagga agtgcagcga gcggggaccc gggaggaaga gaagagtcgg aggggtcaga 79500 

gaaaagaaaa gggaaggacg cgcttggcga gatgggacac tgtgccgcgg gaccgcgggc 79560 

gcaagtaacg gtctttcctt gggaagcctg gcagtgtcgg cgggagccgg cctcggtgtc 79620 

tctcagccga cgcatagccg gagaccctac gcgcgccccc tccccgccca cgctgctcac 79680 

ctccggtcac cggcaaatga gcagccagca gctgcggacg cctccgggag cgcaacgctt 79740 

tcgcggcgcg tccggagtcc cgtgggccca gccctgagcc gcgccggcgc tggggtcttc 79800 

tctgcgtgca ggacccggcc gccacggagc ttcagcctga cagcccggtg gcctcgcctc 79860 

cgctgtctcc tcggaagaag cgggggaact gggaacccgc cgggcgccag aggtctgcga 79920 

agctgggctt ggatgaagtg gatctgcgga gttgatagtt gtatttacac gcgtccggag 79980 

ctgcgccccg aggtgggggc gggggctccc ttcttttccc ctccccttag gtcgagtttc 80040 

acgcgcacgt gactcgcccg ctggtcccgg acactctccc tctggcacag ccccagcacc 80100 

tacatttcca ccctggaccc ccatcttctc ccccaagccc ccagactaac atcaggcagc 80160 

gccctctgta tccttgttca aaacaaagtg cgattcggct gaagccgact gaccgcgatt 80220 

cagggccgcc ttgggtgggg ttttgaactg tgcagctgga agcagtgttt tccgagaggc 80280 

agagtggcac gggtttcttt ggagttagtc agatcgaggt ctgagtcttg actttttaac 80340 

tgactaccct gggttaccta gggcaagtta cctctctgag cctcagcttc ctcctcttta 80400 

aattcggtta aaatggaacc tacctaactg cccaaaggaa tcgcgattgt gatgcaggta 80460 

aaatgctaag catagcattt ggcatagtaa gcataatgtt aattgttgct gctgtcatta 80520 

tttcagaaga cctggtgatc ggatgcttcc agatcaacaa ttgattgact ccaggtaaat 80580 

ctctcagcct ccctgagcct cagtatcctc atctgtaaaa tagactacta tggtgtggag 80640 

taatgagaag taatctcatt acatgtgagt ttaattgtgt gttaagagtg ctgctaatgc 80700 

atgctgagct taatacctag gtgatgggtt gataggtgca ataaaccacc atggcataca 80760 

tttacctacg taacaaacct gcacattctg cacatgtacc ccagaactta aaataaaaat 80820 

aaaatttttt taaaaaaaga gtgatactgg tggccaggtg tggtggttca tgcctgtaat 80880 

cccagaactt tgggaggcca ' aggcaggagg atcgcttgag ctcaggagtt cgagaccaac 80940 

ctggacaaca tggtgaaacc ccgtctctac aaaaaagaaa aaaaaatagc caggcatggt 81000 

ggtgtgcacc tgcagtctca gctacccagc aggctgaagt gggaggatca ctgagctgga 81060 

gagatggagg ctgcagtgag ccaagatcat gccactacac tccagcctgg gtgacagagt 81120 

aagactctgt ctcaaaaaca aaacaagaat gactacagaa agctccaaga aggcctcaga 81180 

taaaagggaa cccctgaaca gatgagccac caagccaaga gaggaactaa tggctaccat 81240 

agacagggca ctttccaaaa taaaaatact gttattaatt cctcaagaca tcatggtccc 81300 

atttaaacct catagctttt cacagaggga gaaactgcag gcttgaagct ggagcaaggt 81360 

tagaggtagg atgcagagtc aggtcggcct ggcatttaag tacggctcct tccattcctc 81420 

ccagaaggag aatggcaaga gcaaaggctt agctgtggga atggcacaag gagttctcgg 81480 

tggccaaagc acatgtcagg ctctgatggt ttaacttctt aaaatgcaat actgcctccc 81540 
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agaacttcca gatcaaggtc aaactcctca gctctacaca gggggaccta gagtcaactt 81600 

tctaagctag gagagtcatg gatccctttg agaatacaaa agacagtggg cgcggtggca 81660 

gtggctcatg cctgtaatcc caacattttg ggaggctgag gcaggaggat cacttgagcc 81720 

caggagttca agacctgctt ggtcaacata gtgagacccc tatttctaca aaaaattcag 81780 

ctgagcatgg tggcatgtgc ctgtagtctc agttactggg gaggctgaag taggatgatc 81840 

cctgagcctg ggaggtccag gaagctggag tgagccgaca tctcgccact gcactccagc 81900 

ctgggtgaca gagaccctgt ctcaaaaaaa aaaaaaaaaa gaagaaatat gttattgatc 81960 

tactcttgac aaaaatgctt gtgtgaacat ggacacacac actcatcaac attcacattt 82020 

caaggttttc atggaccctt tccatgaggc tctagtggtc catggacccc catggctgga 82080 

acacttgctc ttcctcatct caacccacat ttccatggag ttggactgtc tgctgcatga 82140 

ggacacaggc ctcatttggt gtgttcattc actgctgtgt atcccagcac ccagaacagc 82200 

acctcaccta aggggcactc agcacatgtg cagtgaagag tcagtcagct ggtttcacac 82260 

ctcccagtct ttgcacctgc tattccttct tgtgggaatg acagatttcc ttcatttctt 82320 

tttttttttt ttttgacaga ttccagctct gttgcccgag ttggagtaca gtggcacgat 82380 

ctcagctcac tgcaacctct gcctcccagg ttcaagcaat tctcatgcct cagcctccca 82440 

agtagctggg attacaggtg cacaccacca cctgtgagct gatatttttt tcttttcttt 82500 

tcttttttcc tgagacagag tctcactctg ttgcccaggc tggagtgcag tggcgtgatc 82560 

tcggctcact gcaagctcca cctcccgggt tcaagtgatt ctcctgcctc agcctcccaa 82620 

gtagctgaga ctacaggcgc gcaccaccat gcctggctaa tttttgtatt ttttagtaga 82680 

ggcggggttt caccatattg gacaggctgg tctcgaactc ctgacctcgt gatccgccca 82740 

cgttggcctc ccaaggtgct gagattacag gtgtgagcca ctgcactcgg ccattttttg 82800 

tattttttta gtagagatgg ggttt caeca tgttggccag gctggtcttg aactcttggc 82860 

ctcacgtgat ccacccacct tggccaccca aagtgttggg attacaggca tgaaccactg 82920 

cgctcagcct ccttcttcat ttctaatgta ctcatccttc acaactcagc tcaagtttca 82980 

cttctctctg gaagctctac tctaggctgg attcagggcc ttgtccacat acccaccaaa 83040 

tactctgctt acctctatgg aagtccccac actgatctag aataatcagc ttagttttct 83100 

gcccccatcc cgccccatga gatgtacatc ttgtgggggc • aggaaccacc acgtggtagg 83160 

tgatttgtgt gcctgctgcc tatcacaggg cctggcgcct aataagcttg cggccaacat 83220 

ttgttgaata aatgaaaagg gaatggtggg aaaggaagct ' gaaaaggtag gctaaaatca 83280 

gtttggaatt acctctggga ggccaaggac tttcagtctt gcagggtagg taacaggaaa 83340 

ctcctggatt ttgttttctt ttggttttgt ttgtttttaa tgaagggtag cgttatcgtc 83400 

aggtttttgt gtttaattaa tggagcatat attggaaagg acagagacct taaagcagtt 83460 

aggagaccac cataatagtt cacattttgc agccataaaa aggaatgagg ccaggcatgg 83520 

tggctcactc ctgtaatctt atcacttcgg gaggttgagg caggcggatc acctgaggtc 83580 

aggagtttga gaccagcctc accaacatgg agaaacccca tctctactaa aaatacaaaa 83640 

ttatccaggc gtggtggtac atgcctgtaa tcccagctac tcaggaggct gaggcaggag 83700 

aatagcttga atctgggagg cagaggttgc ggtgagccga gatcgtgcca ttgcattgca 83760 

ggtacatgga tgaagctgga agccatcatc ctcagcaaac taacacagga acagaaaacc 83820 

aaacaccgca tgttctcact cataagtagg agctgaacat tgaaaacaca tggacacaga 83880 

ggggaacatc acacactagg gcccgttggg gagtgggggt tggggggtaa ggggagggaa 83940 

cttagaggac gggacaatag gtgcagcaaa ccaccatgac acacgtatac atatgtgaca 84000 

aacctgcaca ttctgcacat ggatcctgtt ttgttttaag aagaaataaa gaaaaaacca 84060 

agaagaaaca aacaaacaaa aataattccc atttaaaaca ataaaaaata ggccaggcat 84120 

ggtgactcag gtctataatc ccaacacttt gggaggccaa cgcgggcaga tctcttgagc 84180 

ccaggagttc aaggccagcc tgggcaacat ggcaaaaccc tgtctctaca aaaaatataa 84240 

aacaaacaaa caaaatagcc aggagtggtg gtgcatgcct gtcatcccag ctactcaggt 84300 

ggctgaggtg ggagaatcac ttaagcctgg gaggcggagg tagcagtgag ctgagatcgt 84360 

gccactgcac tccacctgga gcaacagagc aagattttgt ctctaaataa ataaataaaa 84420 

taataaaaaa cagagaagag gaaagacacc tgagatatat ttccatatct gaatcaatag 84480 

gatttatcaa cgttctcctc tacccccaaa actaattcct tcctaaactc tgttctcctg 84540 

acactactca taggttaagt ataacagcat tatcacattg gctgtcatgt gggctcctgg 84600 

ctagaggctg cttcacagct taatggacaa gagcactgag acagggtggg tctaaatcct 84660 

ggctctgcag ctgattattt gtgtgatttt gtccaaatca ctccatctca tgagcctcac 84720 

tcttctagtc tgttaagtgc tgaaaataaa agtatccaat tcaattcatt atttaatgaa 84780 

ttatttagcc taacaaatag ctattataaa tatttaggct gggcacagtg gctcacgcct 84840 

gtaatcccag cactttggga ggccaaggtg ggcagatcac ctgagtcagg agtttgagac 84900 

cagcctgacc aacatggtga aaccccgtct ctactaaaaa tacaaaaatt agctgggtgt 84960 

ggtggcatgt gcctgtaatc ccagctactc aggaggctga ggcaggagaa cgcttgaacc 85020 

caggagacag aggctgcagt gagccaagat cgtgccactg cactctagcc tgagcaacag 85080 

agcaagactc tgtctcaaaa aaaaaaaaaa aatctctgca tgaagaatgt acataaaatg 85140 

gtgcagccat ttcggaaaac agtttggcag gtcctcaaat agttaaacat agagttacca 85200 

ctatagccca gcaattccac tcctaaatat actacaccca agagaattga gaatatttgt 85260 

taacacaaaa atgtgtatac aagtatttat agctgtatta ttcattacag ctaaaaagtg 85320 

caaacatccc agcagtccat cagctgatga acggagaaac aaaatgtggt atacccatac 85380 
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aatgtcatat tatttggcca taaaaaggaa gtactgatac atgctacaac atggatgaac 85440 

cttgataatg ttattctaag tgaaagaaac cagacacaaa agaccacata ttgtatgact 85500 

gcatttatat gaagtgccca gaataggcaa atccacagag acagaaagta gattagtggt 85560 

tgccagagac tggagggagg agataatggg aaatgtggaa tgactgctaa tggtatgggg 85620 

tttcttcttg gggtaatgaa aatgttgtac aattagataa tggtgatcat tgtaaaactt 85680 

tgtgaatata caacatgctg aattttatac tttattatat tttatttttt ttgagacaag 85740 

gtctcgctct gtcacccagg ctggagtgca gtggcacgat ctcagctcac tgcaatctct 85800 

ctgcctccca ggctcaagca atcctcctgc ctcagcctcc tgagtacctg acactacagc 85860 

atgtgctacc atgcctggac aatttttgca tttttagtag agacaggg^:t tcgctatgtt 85920 

gcccaggctg attttgaact cctggactca agtgctccgc ccacctcagc ctcccaaagt 85980 

gctaggatta caggtgtaag ccaccactcc cggcctaaat tgtattcttt aaaagactga 86040 

attgtatggt gtgcgaatta tatctcaatt taaaaaaaac aaaacaaaac aaaaaaaaaa 86100 

cctttgcgtg tgtcaggcac tagggattcg atgctgaata agacacagac cctaccctca 86160 

gagaacacag agcccagcag gagagagtca cagatgaatc aagtgttaca tcatctatag 86220 

gaagcgccat ggaagaaaga catggtgcca tgagaacata cgcttagaga agggaatttc 86280 

atctagactg gggctcaggg aggaatcttt cagggtgatg cttgtgctca gagttttcca 86340 

tgtcagaatc agtagaattt atcaatcctc cagaggagga aacagcaaat gaaaaatctt 86400 

acaacaggag gatgcggaga cattccgaga gctgatcaag ggctggtgtg aacaaagcac 86460 

ataggatgca gagcctgtgg tgtgaggttg cagctggaaa ggtaaaacac taattacatt 86520 

ggatcttctg agacaataaa gagtatgcaa taatctcaaa cgaccgaaac tgaccttcct 86580 

cctccctaac ttgcttgctt ccactgttgc ccgtatcata aaagcaccac cctcttctac 86640 

ccagtggctt aagacacgaa actcaagtca tcccaggctt tctccccacc tcactctcca 86700 

catccagcct atcagcgagc ttgtgggtct taccacgtaa agacttctca tctccagcta 86760 

ctaccatccc ccaagcccag atcaccatca gctcaggcct ggactcctgc aacctttcta 86820 

accgggtctt cccaatccta cccccgcaac atgaccccaa tagcccatca gaatggacta 86880 

atcgagatgt agatttgatc aggccacatc ccttgaaagg cttcctgtga ccctcgggga 86940 

aatgcacaaa ctcccaatga tggcccctga gtcctgtgcc- atctgggtct gccctctgcc 87000 

ctctgtgtct ttgccatggt aacctccttc acacccatta atactccatg ctctctccta 87060 

cctcaagttc ttcctgggct ggaacattct ctgcactagc ctagccaact aaccctttag 87120 

atcttttgtt tgtttgtttg tttgtttgtt tgtttttgag acagtcttgc tctgttgcca 87180 

ggctggagtg caatggtgca atctatctcg gctcactgca acctctgcct gccgggttca 87240 

agcaattctt ctgccttagc gtcctgagta gctgagacta taggcaccta ccatcacgcc 87300 

cggctaattt ttgtattttc agtggaggtg ggttttcacc atgttggcca ggctggtctc 87360 

gaactcctgg cctcaaatga ccaccctcct cggcctccta aagtgctggg attacaagca 87420 

tgagccactg tgcccaggca acacttcaga tcttaatgat catttccttt aagtgcctga 87480 

cctcttgtag taactagcct gactccagca atgaatcctt ttgcaatgta acctatataa 87540 

catctgagtt tccctttgat aaaactcatc atatatttgt tcctctgaca gttcagaggg 87600 

caagggcctt tgcccacctt cctcaccact atcctctcac cacttaacac agaactcacc 87660 

acccaccatg cctcctgcct gacaaattcc taaccatcct tcaaatctca ctcacctatt 87720 

accttctggg aggcagtctt ccctgagcac caagacaatg ggacacattc ctttatacac 87780 

cctgctgaac atctcttttt tgaggggcgg gtagagatga gtgtctcact atgctgccca 87840 

ggctgacctc aaactcctgg cctcaagcga tcctcctgcc ttggcctccc aaaatgctgg 87900 

gattacaggc atgagccact gtacctgacc gcaactgggt tagnnnnnnn nnnnnnnrmn 87960 

nnnnnnnnnn nnimnnnnim nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88020 

nnnnnnnnnn rmnnimnnnn nnncctgggc aacaaagacc cttcctctac aaaaaaaaaa 88080 

aaaaaaaaaa aaaaaaaaaa aaattatttt aaattagcca gacatggtag tgcatgcctg 88140 

tagtccaagc tacttgggag gctgaaggga gaggatcact tgagcccagg aggttgaggc 88200 

tgcagtgagc cgtgatcgta ccactgtact ccagcttggg caacagagtg agacctcatc 88260 

cctaaaaata aagaagaaaa tatggcaatt tgactgtaca tctctaatgg gatatatcct 88320 

aaggatgaga aaggaataag gaaggacaga aaaaaggaaa caaagaagta gcaacagtat 88360 

ttagcaattg tattgttatc aagtaacatc aatattggta aaaccagtaa ttatatttaa 88440 

aatactatat atgtgtatgt acatttacat atgcatatgt taggaaccaa gtttatcaga 88500 

ggaagagaaa gggctacaaa tgtaaaatca aggaaataaa aatttgaata aaaatatcag 88560 

tattaagtat ttatgatatt tttcttataa aaaaattata tatatgttaa ctctatccaa 88620 

aacccaaaag cagtgacaac ccaggagcaa taaaaaacct cagcatccag actgtagtct 88680 

ctaccatttc caattaaaga aacccagggc tagttgggaa aaatgacaat ttcatgtcta 88740 

gggcaagaaa cacacctagt gaaatggacc tgaacattta attgtgttag aaagtaagga 88800 

aactctctag aaataatgtg atttcatcta aaagacacag attctgggct ggtaaagttt 88860 

tcaatggcca aaggtgagac aatttgagca tcaagaagaa tcatgacaga acagattaaa 88920 

acatgtcaaa tatattttaa aatgaaatat tataaaagaa acaattagta gccatccctg 88980 

aaggtcacta gggcaccaac tcatatttca aactggtaaa taaatgtgta agccaagcat 89040 

ttatttctgg gtaacaaaat agtaaggaat gtttttcttt ctagaagaat tctagtgatt 89100 

aaaagtagaa gatagaaata gaaaatcatc cttttggcca ggggcagtgg ctgtaatccc 89160 

agaactttgg gaggccaagg caggtggatc actggagatc aggagtttga gaccagcctg 89220 
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gccaacatgg tgaaacccca tctctactaa aaatacaaaa attagccagg tagtgggtgc 89280 

ctgtaatgcc agctactcgg gatgctgagg caggagaatc gcttgaacct gggaggcgga 89340 

ggtcgcagta ggctgagatt acgccactgc actccagcct aggcaacaca gtgagactct 89400 

gtctcaaaaa aaaaaaaaga aagaaaagaa aatcatcctt ttgcgatcct aatgaaataa 89460 

tgggcctagg cattgatcat taatggctcc taaaatcact aagtatatgg ttgatgggaa 89520 

actttatagt ggatggatca gactcgcaat gtctaaacca gttgatcaat cttaacatcg 89560 

taacaagaca acagacacca ggggctgctg acaggagaac agaggaaacc catagctcta 89640 

ccactgagtt attcacggca aaaaaaaaaa aaaaaaaatt aaactgcgtt tcctccaagc 89700 

ttctaatcct gttgtttaca ggaaataccc aaggaaagga atacttttaa atgacacatt 89760 

aaaacaacgc caaatccaaa atatggggaa atgacccagt ttcttcaaca aataaacaag 89820 

aaaaggtagg ggggaggact gttctagatt ttaaaagcta tagaagacac agcaaccaaa 89880 

tacactgcat ggaccaggca tggatcctaa ttggaacaaa ccaactgtaa aaggatgtat 89940 

ttgaaatgat tggaggaatt tgaacagtga ctgcacagta gatgatatga agaaattatt 90000 

gttatttttt aggtgtgatc atgattttat ggtgatgttt aagtaaaaga ggccttattt 90060 

gttagagata catgtacggg tatagagaaa tatttacgga tgaaatgata cgatgtctga 90120 

gatttgcttt gaaaactcta gcaggtgtgg gagaagcagg tgcatgggtg ggggaaggga 90180 

tagatgaaat aagtatgcaa aatgttagtc tacttttgtc cctcctgacc cagcaggtta 90240 

aaatacctca gcatacctct actcctccaa ccaggtccaa ggatcaggcc aaaactccct 90300 

gatgtggtaa acagcctgac cccttcttac ctctctctct ccagccactt ccctaagatt 90360 

ccccagtgct ctgtgcccta gccagcccga ctcatctgcc cagattcctc aatgtttcac 90420 

tctctcattc accattttga ccccactgtg ctcctgggcc actctccaag gccccgcctc 90480 

ttcatctcct ccctccttac tcatccttca ggtcttggct taggtgccat tccctccagg 90540 

aagccttccc tgacaccaat cccatcctca cctagaacag attatgtgcg cttctttgtg 90600 

ccccccatgg ccccctgtgg gtttgcttca cggattataa ctgcctgact acctgccttt 90660 

ctccaccctc tagactgaga acaccttgag aaaaagaaca catctatctt gtctgtcatt 90720 

gaatccctgg tgtctggcac catggctgac acataactat cactcagtga ctagtgtttt 90780 

aatgaatgaa tgagtgcaac tagacagggt taagaacaaa agagaagacc aggcatggtg 90840 

cttcacgcct gttatcccag cattttggga ggctgaggcg ggcagatcac ccgaggtcag 90900 

gagttcaaga ccagcctgac atggtgaaac cccgtctcta ctaaaaagac aaaaattagt 90960 

gggggatggt ggcacacgcc tgtaatccca gctacttggg aggctgaggc aggagaatct 91020 

cttgaaccca ggaggtgcag gctgcaatga tctgagatca caccactgca ctccagcctg 91080 

ggcaacagag taagactcta tcacaaaaaa aaaaaaaaaa aaaaaaaaaa gagcgagaga 91140 

agatgtcatg gggtaaatga agacctccct tcctggttcc ctgaccagcc cctgccctcc 91200 

cccgcatctc acctgtcttt cttgcttcct tctggtactt ctgtttaagc cggtccatga 91260 

gcaggccatt ccagggggca cacagcactc cgaactgagt gaaggcaaag gcatttgtgt 91320 

aggtgctgac tgcagaagga agagagaggt tggttgatga gaagtttcca aaactccctt 91380 

ccaggcaggg actctcccac cttacccttg tctgcatgtc cctcctcccc acaccatcag 91440 

accctcctct ggtgtgtaca gccctgctgg gaggctctgt gttcccagct gggacatgca 91500 

gatgggctac ctcccagccc taccacatac ctcgtgccat gtccccaccg gccatgttgg 91560 

tcagcaagga gttgagagtg ccaatgaaga ggtagtgcca caactgtatc acagacagcc 91620 

acaccaggtg ccaggcaaag cgccgagaga aagcgtagct ccagaaggag cggagttcct 91680 

gcttctgccc tgcccctggg gtctctaatg gggagaggag gatctgggcg tgaattacga 91740 

ggaaagtgga caggtaggat ggggagtgtg gaggcttcaa tggaacattt cagatccggg 91800 

cccaccttct acccttggct cccagaactc acccggcgtg ctctgcacct gcctcggact 91860 

cacacatccc cacctccctg ctttgtcatg ctggccctac caccttggat gaccctctgt 91920 

gttcttctct attgaaatcc gatctgtctc tcacagcctg gtcaatgaca cttcttgcag 91980 

taataccttc ctgatctttc tcagcgagaa aggtgaaagg aacgacaagg agaggagaaa 92040 

gtcagaaggg agaggagaat gagtgtggat actctgttct aacctgctcc tcagcacctc 92100 

cctttctttt gataccagta tcctgagttt ctttgggaaa tcttcctcta ccctaatctt 92160 

catggtccag atgggaccat gaattcagtg ttctgttcct ctctcaaggt taaccaatga 92220 

gatggttcct cctaacaagg cagaccggcc atgagtttag attaggatgg acttaatcta 92280 

aaataggtcc tacaccctgg caagttcaat gtcctccctt gattttggaa gcttcccaga 92340 

accctattct tctttcttta aaataaataa ataaatacat gttttggatc caattgtcag 92400 

atggtaaaaa taaaaacaaa aaaatcaatt ttattctgta tatttaagat atacaatatg 92460 

aggtcatagg atacatatag ctactaagat ggttactaca gttaagcaaa ttaacatatc 92520 

catcatctca catttctacc tgttttgtga caagagcagt taaaatctac ttgtttagga 92580 

aagtcccaaa cacaatgcag tttgatgacc tacagtcttc gtgctgtgca ttagatctct 92640 

aggcctgttc atcctgctca tctgctcctt tttgtccttc gacctgcatc tcccatctcc 92700 

tctcccaccc cgttcttatt tctactgtag ctagctgcgg tttgtgatgt gtgtaaccaa 92760 

agacgcagaa cagagaggaa ggaaaggaag cagtgataga gttgggacaa taagagaggg 92820 

cggacccagg agacctggag aaatgggggc actgtaccag acttagtgca atggcatcac 92880 

agaagagggc agaaccgagg agtgggggga agggaaggca acccatggca ggcgggcttc 92940 

aaggggtggg gaagtgatag gatgcgaaat agagaaaaga gggacagaaa agagacgaaa 93000 

gccctggacc ctccattaag tgagagggtt gggaagatgc ctaaggccct ttttctgtcc 93060 
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tgcctttcct gattctgggt ccctggggga gctctggagg tgaggggcca ggaaaggcac 93120 

aaggagaggc ttgggtctgg aggagagatg ggttagccag cagggctcac cttccttcgc 93180 

tgaaaggaac tcctttgact gtagctccct gttttcatgc tcagctgttt ccttctcttc 93240 

ctttgtggtg ccattcccag ggcacaggct atggaaacaa aagccccacc agcaaggcca 93300 

aggactgtga gccgaacctg agactcagac tggagggaat agcatggtga atcccacatt 93360 

ccaccgcact ttggaatcac cttttagcca ctctgatgcc caggttgcag accagaccag 93420 

ttaaatcaga atgtctggag gtgagagcca ggcttccttt tctaagatct ctatgtgaat 93480 

ctagtgattc taataagcag caaagtttag gaagcatgaa aagagtaggg caggcccagg 93540 

ttcaaatccc agctctgcct cttcctagca acagaaagat ggctcagact taacccttct 93600 

gagcctcatt ttttgcattt agaaaatgga gataaggata tctcagagga ttattgtgag 93660 

gatgaaatca gagagcacat ggggtctgac aattagtaag tgagcagcaa aggaatgccc 93720 

ttcctctact ccttgtggca aatgactgca aaaatgatca catttcttca cctcctctgt 93780 

atttccccca atttgaatga gactgcagct ctatttcccc atgccctgaa tctgggccag 93840 

ccttgtgaac tgcttcagcc aaaagaatgc agcagaagtg gctgtgccaa ttccaagctt 93900 

aaatctcaag aacgcttgtg catttctgca ctctttcaga accctgaaat cacggtgtga 93960 

atgagcccac gctggcttgc tggaggatga cagccacgtg acccaggcat ccctgtcact 94020 

ccaaacctat gtgagtgagg ccatcctagc atagccagcc cccatgtaat cctccaaatg 94080 

atcagatgta tgaatgagcc ctgtcaaaat catctacatc tggccctgat cagcggaact 94140 

agccagctac ccacagactt gtgaaaaata ataaatgctt aacattttag gctgctgagt 94200 

tttgagatag tttgttatgc agcaatagct aacagatgca ctgctccagt cctcctcctc 94260 

tcctgtgata ggtttgcttt accctgtcca tcccacccta gggccaatga ggggctctgg 94320 

cccacaatca ccagatagtc cttacccata gctgtagttg gggggcagtg ggtatgggat 94380 

gtgcccccgg ggcatcagga ggaaagtgcg tgctacatgc caggtactgc agacagagat 94440 

gaagatgaag gaggccctga ggctgatgcc tttttcataa agaagctgca gaaggagaag 94500 

gaaaaagtca gtgtcacacc cacgttcata gcagcactat tcacaatagc caaaggatgg 94560 

aagcaaacta agggtccatc agcagatgaa cagctaaaca taatgtgatc tatacacaca 94620 

atggaatatt attcagcctt aaaaaaggaa agaggcaacc- atgctggctc acacctacaa 94680 

tcccagcact ttgggacgca cgaggatcac ctgagcccag ttcaagacca gccttgacaa 94740 

catagtgaga ccctcacccc ttctctagaa aatttttatt' taattagctg ggtgtggtgg 94800 

catacacctg tagtgccagc tactcaggag gctgagtggg aggatttctt gagcccagaa 94860 

gtttgaggct gcagtgagtc atgactgggc cactgcaccc cagcctggac aatgaaacat 94920 

gaccttgcct ccaaataaaa aaaaaaaagg aaaggaaaga aattctgaca catgctgcaa 94980 

catggatgaa ctttaagagc actatgcagg gccaagctca gtggttcctg cctgtaattc 95040 

tagtgcttta gaagaccaag acaggaggat tgcttgagtc caggagcttg agaccagcct 95100 

gggaaacagc aagacctcat ctctactaaa aataaataaa taaatcagct gggcgtgatg 95160 

gtgcacgcct gtaattccag ctacttggga ggctgaggtg agaggatatg attacatgat 95220 

tacatgcctg taatcccagt actttgggag gctgaggcaa gcagatcacc tgaggccagg 95280 

agttccagac cagcctggcc aacatggtga aaccccgtct ctactgaaaa tacaaaaatt 95340 

agtggggcat ggtggcacgc acctgtaatc ccagctactc gggtggttaa ggcaggagaa 95400 

tcgcttgaac ccgggaggcg gaggttgcag tgagccaaga tcctgccacc gcactccagc 95460 

ctgggcaaca gagcgagact ctgtctcaaa aaaaaaaaaa ggttaagata gtaaatttta 95520 

tgttatgtat attttattgc atacaaaaac atcagcagaa gaggcagggg ctggaaccct 95580 

gttttctaag gagtcctagt acaagccatc acctactatc ctgtaagctg attagggaca 95640 

cctggtacac acatgccccc acccacccca agacacaccc ggcagtagag gagtcctcat 95700 

acgacccatc cccacagccg gtggagcctc ctcgtgtggc tccccagaga tcttctagcc 95760 

cagtgccttt tttcccccaa cgacagcaaa ggccttttgt tcaaagaaaa ttttacacaa 95820 

aaattcatct tacaaaacac accaatgggg agcttgccag tcatctccct ctttattctc 95880 

cttggtgact ggtatgacat caaagagaat ccctaagttc ctcaacagct cagtttgaaa 95940 

accaccgacc tagcccaacc tcctcccatt ttacagagag tgacgttgag gtccagagag 96000 

gtgcagtgaa ttgctcaata aattgacaga gtaagcagca gcaaagtcag attaaactaa 96060 

gaattcctgt tcctgctccc tttccccttc caactctaga gagacaggag agaggctggg 96120 

catggtggct catgcctgta atttcagcac tttgggaggt caaggaaggc ggattacttg 96180 

aggtcaagag ttcaagacca gcctggccaa catggcgaaa ccccatctct actaaaaata 96240 

caaaaataag ctgactgtgg tggcacgcct atagtcccag ctactcagga ggctgaggca 96300 

ggagaattgc ttaaacccac taggcagaga ttgcagtgag ccaagatccc accaatgcac 96360 

tccagcctgg gagacaaagt gagactccaa ctcaataata aaaaaaaaaa aaaaagagag 96420 

aggaaagaaa gatgaggcag ccatctgggt tctccagggg aaggagggag aacccagaaa 96480 

gtgactctta tgccaggagt agaaaggctt gagtgcctca ggggctcagt ctctgcataa 96540 

ccctccaaac ctccaaagct tatgggacta agctagactc atgtctgggt ggtgactgcc 96600 

agagatcctc ttctctgccc ccataacctg caggcagtgc caactgcctg tgacctaaca 96660 

ctaagcccag agagaagtcc caggttggat ggcttgagat ccacactctt cccttccttt 96720 

cactcagcca tctgtggtgt gctggcttta gtcctccagc ttgctgcctc ataattgaag 96780 

catggttgcc acaactccag ctatcacatc ctcacaccac aacattcaat gaggaagact 96840 

ttgtttttac tctgctttca ccttgcgtca gggaagaaaa gtccccttga atcttccact 96900 
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atatacactc cctttatctc attaaaaagg actggatcat atgctgacct ccacctatca 96960 

ctagcaacgg gtaaatggat tgccatggtt ggctttaatc aatcaggatt catcccctgg 97020 

gctaagcggg tcactgccca gataaaactg ttcgcaatga ataagacaga atggttgttg 97080 

attgacctct aatagccttg gcaacagttc atcccctgat accccaacat cagccactgg 97140 

gacagctgga caagcctctg tgtctgcccc tgctgtaccc actagccact tgccaccttc 97200 

ttgtccaaac tagaagctca cagcagcaaa cgccccactc taaaggtccc ccagcctcta 97260 

cccaacactg gcccaagcac attatgacca ctgccacaaa agcttgggca agtctgaaga 97320 

aggggcttag cggttacaag ctcaggctct agaaccgaca agcctgggtt caagttccag 97380 

tatcatggct actagctgca gaaccttcaa caagcttttt aacctcagag actcaaatgc 97440 

ttcatctgta aaatgggggt aacacagtac ctacctcacc gagttgatgg agacaaataa 97500 

tgcaggttca caagacaagt gtctggcata tacaagtgcc cagtgaatgt aggctgttgc 97560 

tatatttacc ttaataataa ggaagactgc cgaggaagag tcaaatgctc cattgtacag 97620 

agtgatgatg gtcgaacggt gttggccaaa taggttccca atctggggat gataggacta 97680 

gcctggatca cttatttatt catgaaacag atacttcctg agcacccagc atgtggcaga 97740 

ccctccttat acccaaactc accctccacc gctagagctc ccacctcagc ttgggccaac 97800 

cccatctgag gcagccaatt atagaaaagg gtctctcctt ccctccacct tcccgccacc 97860 

ctgccgagtg cctgggatta gggaaggctc ccacctgcag gttggtgatg agaaacagga 97920 

ttcccccaat ggtgagcatt ggcatggcca ggaagagcag cacggctgag cctggaccat. 97980 

caaagtcaga ggtagggtgg tgtcatagtg caacccaaac atggagcccc aaactctgct 98040 

cccacctgct ccaaattccc aacaatcctg gtatccaggc cccaattcta gccagcgttc 98100 

cagcgtcctt caagggtttt taggataccg gccaaggctt ccccagatat ctctgtggaa 98160 

gtcttctgag ccaccttctt cccaaccaaa gttggtcctc agtctgtggc aggccaggaa 98220 

gtctacagac agaggcagag ctctaagtga agccacctct ctcttccctc agtaaaccac 98280 

aagctgcctc tccctttcat ccttgacact cctggaaaag aagaccctgg actcaggtcc 98340 

ctggctcaac cctctagccc attccctaat tcatggtatt ggccttgagc ttcaatcatc 98400 

tgtttaatgg gaacaacagt tcctgctctt cctgtctcag gtgctatgag aactgagtga 98460 

gaaaaggacc atggtctttt ctttgttcac taaactctga- gcacttcttt ggtgccaggc 98520 

attgtgcttg gcactggaaa tgcaagatga atcagatagt ccttgccctt aaatagactg 98580 

acatgcaaac aaatggttat aacaggtctg gtaagtgtga gaccacagca aaaaagctca 98640 

agagctgggc taggggaacc cttgacaaat tcttcctccc caaaccagac ttctgcccac 98700 

cattattctg gccacaacct atgcctgtcc tattatttgc taaaatgttt taagttgact 98760 

cacttttatc caaaaagtat ctatttttaa aggacacttt atatcactac tgtagatgaa 98820 

aacactggca ttacttgtca tgaatagaaa gtaactgtca aaataaatac aatgaaagga 98880 

aaacaatgtt attcaattgt agctggatgc atttgacctt agaatgttca aagcctaaga 98940 

cctgctcttc ccatcagtgt taaaatcaca ctggccccac atgaagacat tctttcatga 99000 

aatcagaagg actgaaagag aaataaaaag ggaatagctg ttctaccagg tgatttgatg 99060 

tttgttagtg tagttcacgt agtatgcgtg tgcccctaac atcctcttaa ctaccgtgct 99120 

ataccttaag aagcactgcc aagagctaat tttagagtat tcacacagtt taccattcaa 99180 

tttctgtctt tataaaatgt acatctctcc tactactaaa ggttggagac tcctttcaca 99240 

atagagtcct tatgggctca atgctttttt caaaactgaa aagccctata ttatggagga 99300 

agaggaggat tgttgctcag acgatttgca ggcacgagtc aaacattacc cagccaccac 99360 

ctccacattc agttgcttaa aaatcattta caggctttta gagtagatga tgctggtttg 99420 

ataaggagag tggtttgaaa taattggttt gaggtgctgg gccatctcat gagatctgtg 99480 

tgaacaaaga cactcagcct ctgtgtttgc ccagcatgag tgcagacaat ctcatgatgc 99540 

tgtcagcttt agcatagctt acacacacaa gagtaatgta ctttctttcc taaaccaaaa 99600 

attgagccac gggtctaaca ctaggaagga atattgggag gcatctcgtg gccaccataa 99660 

ccaaggcaat gacagaaaga agagtgaggg atcaggaggc ctgcacatca ggcccacctc 99720 

ccacttgctt tctctgtggc catggacatg tctttgcaag gggtcctgct gtggcttcag 99780 

tttctcctct gtgtaatggg tggaagggtg gtggaaaata aaccagattg gagttccaga 99840 

cttaacagac tggtgaaatt ttaaaacaaa gattttgagt acaatagggt tgtcaacttt 99900 

taccctgcta agtaaggata tttgcaaaat ggtcattcat ataatcattt cattaaaaag 99960 

agaaagagaa cattttaaca cataggagaa ggatgtaaag gttttttgtt gttgttttgt 100020 

tttttagggt tttttgtttg ttttttggta gagtctcact ttgtcaccca ggctggagtg 100080 

caatggtgtg atcttggctc actgcaacct ctgcctcctg ggttcaggca attctcctgc 100140 

ctcagcctcc tgagtggctg ggattacagg cgcgcaccac catccagcta atttttgtat 100200 

tttttttagt agagaaggga tttcaccatg ttggccaggc tggtcttgaa ctcctgacct 100260 

cagatgatcc acccgcctca gcctcccaaa gtgctgggat tacaggagtg agccactgca 100320 

cctggccaga tgtaaagttt tgaataaatt ctactctctg aagtaatccc tctccatcat 100380 

ccttgctttt cacattttct caataaactg ttttcacaga ccagcaatag ctcaagatcc 100440 

ttccaggatt ctttcaagct gcagatctct gaataaccat gtggtctgta tatcttgcct 100500 

atagccctct gctcacacct gccccagcca ccaggtgcct ctgagcttgc atccctccca 100560 

cccacctgac agcactcacc tgcagaggtg aaggctatga tgagtgtggc ggtggtgtag 100620 

aaaaatctga aacacataac aggaaaagca gaatattgtc aaggagggag aaacctggga 100680 

gaaaaaacat gattctgctc agccagccca caagtgtagg acttgaccgc accctcagcc 100740 
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tgggatgcaa 

gcgatgagat 

ccaaggctag 

acttagccag 

aaccccttgg 

ctcctcactc 

tataaaaaga 

ccaaggttat 

aaagcaatat 

gtagcaggga 

aaattgtaag 

cccgtctctc 

ctgtatcagc 

aggatgcagc 

ggaaaccccc 

ccaaagaaac 

gggggcggtg 

gatgaggaca 

ttgtaaccct 

agcacgtgat 

ggggctgtgg 

agctgctggg 

ggatttcact 

ctggcctttg 

atgtagccag 

gagaacctct 

agccttcctg 

tcgcaaactt 

ctggtgcctc 

tctcatcata 

tgggagccag 

gcattgccaa 

aagacaaaca 

agcccagtca 

ggagtggatc 

gcgtttgagc 

tggttgtttc 

aagatagagg 

tgtgaggtgc 

tactccaata 

ggtcttgatg 

gagcaggtgg 

gacagcaact 

ctgccaagtt 

aggaacttgc 

aaataccccc 

tcctcgcatc 

cccgctccag 

ggcgccccat 

tctccgcgcc 

tcccctgctc 

tctccatcac 

ccctcttgaa 

cctcgctgcc 

aatgctgtta 

cacagacagc 

cgctgggttg 

gtttcacact 

aggactgagg 

aaacccctta 

gccagactgc 

attgctcatc 

tccagcccag 

aagctcagtt 



cgggcactga 

tgagatgtgg 

catggaaagt 

gacagaatgg 

agaaaggggt 

tgtgaccttg 

aatgatcgga 

caggctccct 

cctaaggctg 

gccaaggacc 

ggactggctt 

tctctgacag 

ccccagcagg 

aacacactgt 

ccagtctccc 

ttcctattag 

aggaggatga 

aggatccttg 

ggctccaagc 

agggaagcca 

tgaagtgggc 

agcgagctgc 

ccaagctagg 

ggacttacat 

tggggaatgt 

catcctgggc 

ccaaatgagc 

gtcccttaag 

cctcccaggc 

gaaagagggg 

acttaacttc 

tcggcccagc 

ctagtgaagg 

gcagtgtggc 

ttcaaatccc 

acttggaaaa 

aggcctgggc 

ctgctggaag 

attaagaggc 

catcatctta 

gccacctgga 

agaagcggag 

caaccaggtg 

ggtgttatct 

ccaaggtcac 

aactcccatg 

aacttctacc 

cctgcggcct 

ggcgcccgaa 

ctttccgcac 

gcctccgccc 

ttccttcttc 

acttctccct 

tttcagcctc 

ctcgctcagg 

acatgagggc 

gcagggaaca 

cccagagctg 

gcttttcaga 

gatgactccc 

attgattaac 

ggagcttgtg 

agcacgtgag 

^99ggcigagc 



tgcctctgag 

tcctgacctc 

cttggacaat 

gcaaagcgaa 

tagtgcctgg 

ggcaaggcgc 

ctgatctagc 

tagcatttgg 

gagaagaggg 

aagaaggact 

tactctggct 

ccaatctcac 

ccgctgatta 

gtgtgagatc 

ctccagtcag 

ctaagcccct 

ggatgaagcc 

gggtgaagga 

cagcccttac 

catttatgca 

aagtctagag 

aggtgtggtg 

ctgctgcctg 

ggctatgagg 

catgaagttg 

tttgcagtct 

acacaggctg 

ctgagttgaa 

acttcccaga 

tgggcagggg 

cttagaaaag 

atctggtcca 

ccagccaaag 

cacgtgcagg 

actttgtcct 

ttcctctggc 

aaaaaccatc 

aggaggcctg 

aggatgatca 

ggcactgcat 

gtcacaggtg 

ctcctgcgct 

cagaggcagg 

tcagtttact 

agggctagtt 

accctcgcct 

aggaaagcct 

gccctccccg 

aggggaccct 

gggccaggtt 

cctcctgccc 

ttttcccttg 

ctagaacccc 

ccagccccct 

cccacctagg 

atagggacac 

tatttttcct 

ccaaaaaaat 

aaccatcccc 

tgcctcaccc 

cattcattgc 

atcagcgtga 

ctgacgtctt 

ttcttgcgta 



ccccaggctc 

atggacagtg 

gagctcagct 

ggtcctccca 

tctgcaaatc 

tgtccttctc 

tcagagtgct 

gggtcttgta 

aggggacaaa 

gaggtacagt 

gtttccggaa 

atgtgcctcc 

ccactgagcc 

acgtcccgaa 

aagggacctg 

agggagtgat 

tgggcaacct 

agagaagagc 

aggaagtcac 

gagcagggaa 

agagtcccgc 

cctggcaggg 

aaggattcct 

cgtgccacgg 

ttcatgaagg 

ggagtagaaa 

ggctcccctc- 

atgtggctgc 

tcaagtgggg' 

gcagagtcct 

tcatccctgc 

cacagatcct 

aggacgccag 

ggcaggccct 

cctggacgga 

aagccaagcc 

agcgggtgat 

cgggaaaggg 

tggccgctgg 

ttgagtaacc 

agataaacga 

cgaattctga 

gagatttcat 

gacaatgaaa 

tagtcacgac 

aggactcgca 

ccgggggccg 

cagaggagcc 

tcgccctacc 

cgcattcgcg 

gcccggaagg 

ctcacagcct 

ctagaacccc 

tgcctctgcc 

gaaaatgtca 

acacactcta 

atttgctcac 

cccaaccaca 

tggaggactg 

ccgcccccca 

cccatttttt 

ggccttacta 

ctttggcctg 

ttagatgcag 



aaaaccaggc 

cattttgctc 

gacgatgtga 

cctggaagcc 

aaggccttga 

tgggcctcag 

atgatttcag 

aggcatggag 

ggaaggggag 

cattctgcat 

aggcaggccc 

ctgggagcac 

tggccacaga 

ccttctgact 

aaattccacc 

tggctgttgg 

ggatgtgagg 

aattttaggt 

cctggcctcc 

cgaggtaagg 

tgcctggggc 

tggccgggct 

cttacccacc 

tggtcttgaa 

accccagggt 

aaaggtctcc 

cacctcagac 

ccctaattac 

tgagagctgc 

tcctgctcct 

ccttaccagc 

taaagtaatc 

caaagcccag 

ggcccgccat 

tcacaggcgc 

cttcctttcc 

tctctggatc 

aaaggtagac 

cagcaaatgt 

acgtggcaag 

cttacccaat 

ataccgtccc 

acagaagaca 

caaaagctcc 

gcaggccatt 

aacctggtcc 

ctccccgcca 

cgaggggcca 

cgcctgctcc 

cctctcgcag 

ggctggggca 

cccgcgccct 

agcggtgtct 

tcccctaacc 

cacccagcac 

tttgtgcatt 

cagcttaacc 

gaatcaggaa 

ccccatattt 

ggttctgaaa 

attaatcaaa 

agcagctgcc 

tgtggccgtt 

tctgcagact 



gcaagaagcc 

attctgaggc 

ttggcttggg 

ccaacagccc 

gttctaactc 

gagccttttc 

gactacagtc 

taaaaaaaaa 

gaaaggggag 

ccaaaggctt 

agccagccct 

ctgctctgag 

gcacgagatt 

catctgcaca 

agtggcaata 

ggcggggagg 

ctgtgcaggg 

tttgctaatt 

ggctcaattc 

aaatggaagt 

tgttcctaac 

gtctgactct 

tttgcctggg 

ccggtcaaag 

gaagatgagt 

catgcatccc 

agcttgtcgg 

ccctcaggag 

tgacccttcc 

tgccaccacg 

ctgccctgtg 

ttcattcttg 

gcattccagc 

gagcagaagt 

cgtaagcctg 

cgtagctctc 

ctgtagaata 

tagagttatt 

ggggaataaa 

tagagaggca 

agctccccgg 

ctaaaataat 

caaactcccg 

catggatttc 

ctactgccag 

ccgccgccct 

gcctccgcac 

ggccgcgctc 

gcgccggggc 

cccctcccag 

gacctcccac 

ttttacctct 

ttccctccct 

aagttagttg 

ccagaggaca 

ttgccttgac 

gtctctccca 

gccaagaacc 

tcactcccaa 

gagccttccc 

gacatatata 

ttactatcct 

tccttgccaa 

cccaacccca 
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gctacctgga 
tttaagagga 
caccaccaaa 
gaagggtggc 
tctttggctg 
aaaagaagta 
gtagggtagt 
ggaggaattt 
tctgaaacgt 
gtcttttctt 
acaaacaggg 
ttaatgcatg 
gtgtggaagt 
gttaatctgc 
tgaagcacgg 
cttgactctt 
cactaacaca 
cccatgttct 
tcctgcgtgg 
ggtctgacgg 
acaccttttt 
tgtgccaagt 
ctcttgtggg 
aggcagatgc 
ggaaccagct 
tctaagtgag 
cccttaatat 
cacagaggca 
gttcccactg 
tggtaaggac 
agggagcata 
tttccttccc 
gacagagtgg 
tataagtgag 
ctcaagagca 
tcctctgaac 
tcctaccctc 
ataaagggct 
tcacaatggg 
ttctacagcc 
cagagaaaat 
agccaattac 
tatggattaa 
tactgttgtt 
tagggagaat 
aagtaatggg 
tagatgacac 
gcttggattg 
gatggagtct 
gagggtgggg 
tttctcttta 
cactctggcc 
cagcctggat 
actggaagca 
cttatgaggc 
caactgttta 
cactgacaat 
ggaggcacgt 
tgagcactga 
cgataggtgt 
aactccccca 
ctgaaactaa 
tcaccaaggc 
tcaagtgatt 



tcccctgagg 
agaaatagag 
gcacaaaaga 
tggggtcctg 
ccccacctgt 
taagatgggt 
agattttcta 
caaaggaact 
atgctctcat 
aaaataacat 
cttaagatgc 
ccctatcttc 
agcaagggag 
aggagcctgc 
gacacggggc 
agagcctcca 
agtcttactg 
ttggttcgaa 
ctccaatgag 
tgcacccgat 
ttttttaatt 
gccgggattc 
agagacagac 
tagcacagca 
agagcacagg 
ggcattaaga 
ctctcctatc 
tcatctgctt 
gatggcttca 
cagtccaggg 
gaactggatc 
atcatcagca 
gaagatggat 
ctgcactcca 
ccaccttcca 
aacctcagcg 
tgacatcact 
ttgatttcgg 
ctcatgtaaa 
tgttttggaa 
tctaatcctc 
tggtatctct 
agataatgta 
aacaaatatt 
ttcaaagtgt 
tgagctcagt 
ccaaatgcac 
tcctgtaagc 
tagagctggc 
aactactaca 
acagaggtga 
cctcagagca 
cagcctccct 
ctgggccagg 
tactgctacc 
ctggtaactg 
atctaaacct 
cctacccctg 
ctatgtatct 
tgttattagc 
acatcaccag 
tgcttttttt 
tggagtgaaa 
ctcctgcctc 



gcccaggaac 

gttacgatag 

taaaagtctg 

taccccaagc 

aagacgagga 

gtatgaggtc 

gtctgcagtt 

tgcccacggt 

caccccgtct 

tatttatggg 

aagattttct 

tgctctggca 

aaacagaaaa 

cagaaagatc 

aggtcgtcca 

ctccacatct 

aactgatggg 

ttccagcagc 

gattaatgag 

gtcttcatcc 

gactgattgg 

gtagagaaga 

aaagaaaccc 

cattgaaagg 

gtggtgaaga 

attcagccca 

tatccgtggc 

ccaagccatc 

cttaactgct 

tctggggatg 

tcctgcatcc 

agtctggata 

atgtttaggt 

tcattcccct 

atgcagtccc 

agggcaattg 

ttctaaattt 

atacgccaaa 

ttaaaatgtt 

gtcagaaagc 

ttgaggcagt 

gagcttccgt 

tgaaaactgc 

attgacttac 

tcctaaccta 

ttaccttgct 

tcagtacaac 

tgtgggaagt 

tcttcagggt 

tagccaagta 

atctgaaagg 

gccagctcag 

ttcaccatgc 

cctaagagaa 

caccctccca 

cacttcccga 

gcaactctaa 

agaagaaaaa 

gattcgaagc 

atctattatc 

gtagcagtgt 

tttttttttt 

tggtgtgatc 

agcctcccga 



tccagctatt 
gggacagcca 
caaccaccct 
tactcactag 
caatagtacc 
cctgcatggc 
atgtagacag 
cattctacaa 
taacaaacat 
agaaaatcca 
attttactgc 
tacattttag 
aagtcaaagt 
tagctcatgg 
gggttctgtc 
cccatcaatg 
acaggaaatt 
tagaaaaggc 
taacatcaga 
ttttctcttc 
ttcaacaaat 
gattcagtgc 
aagatttctg 
cacggaacct 
aagtggaaag 
catcaatcaa 
cactgctcta- 
gccattctcc 
caaaaccctt 
ccagccatga 
tactgcccaa 
tccaggatcc 
tagggaaaga 
ggcacaaaca 
tgcctgtcca 
ccactctctg 
gtatatgtag 
cacataaaca 
tggttttcca 
aaaaggtaaa 
gccaaaaata 
ctcctcatct 
ccaattagta 
tggaaaacaa 
aaaaagaaat 
taagaagtcc 
cccccaagac 
ctgttcatga 
atctcctagt 
aatatgaggc 
atacccaaag 
cttcagtgga 
agaaaacaga 
agcagagccc 
agccccagct 
tagactttgc 
ggactaggct 
taaggaatat 
gctttttgct 
tgggccaact 
caggactggg 
tttttttttt 
tcagttcact 
gtagctggga 



ccaagcccac 

gaactgagga 

agtgacttga 

ttatacaacc 

ttaattatag 

gcaggtgcta 

agccagagaa 

agctgcagta 

ttggacatta 

caaaaata^a 

aagacacaaa 

tctcctgggg 

aaagagacag 

gctatctgta 

caccttatct 

tctgcagaag 

agaatatcct 

agatgctatt 

gagagaagtg 

gcctccttcc 

acatgtggta 

ctgctctcaa 

gagtgtggga 

caacaaaaca 

atttgaggcc 

tcatgtccta 

tgcagacact 

tgcaagagtt 

ctgaggtcca 

gacattgctt 

gtaccaatgc 

accctatgga 

gggtttgcca 

atggctagta 

cagacctctc 

ggcagagtcc 

ataaactctg 

aacaaacata 

cctacggtct 

tgcaaacatc 

atacaagcac 

ataaaattgg 

cactattaat 

agaggaaata 

aaattagggg 

caccctagag 

tgtctgggct 

gccacagtag 

gtgttcaaag 

ctccttgctc 

aggcactgga 

tgctagaggc 

gctcccccac 

caagccccac 

ccacttctat 

tagaaaggaa 

ggggaacact 

caataatacc 

taatctgtca 

gaggcctaga 

atagaaacct 

tgagacagca 

gcagcctcca 

ttacaggcgt 



tcctcttttt 

ttttccagct 

ctgaatggag 

tgaggcaagc 

gaattgtcat 

taggcagatt 

gcagctctgg 

ccttcccaac 

gagaaaacaa 

gcatcccagg 

gactctgaaa 

ggatcagtaa 

attttagaat 

catccaggac 

tgttacctct 

acgtggcctc 

ctgaaccatt 

ctgatcactc 

attataataa 

tcatcatctc 

cctcaggctc 

ggggctcatt 

atggtcttcc 

ataacattta 

agcgtcgcca 

ttgatttcac 

catcatctct 

tatttccatg 

gtcaactggc 

tgaggggaag 

tggaggtggt 

tgtttttatg 

aagagggcag 

tcctctagtc 

tcctcaaact 

agatattctc 

agccattcac 

agctttcctt 

tgaaaggggt 

atttcacctg 

actgctatcg 

aatcgagctg 

aaatagcagc 

aagtcacatt 

gaaaaccact 

aactgatctc 

taaggcaggg 

acaggaaggg 

cagttctcag 

tggggagacc 

gggtgggggc 

agcagaggat 

cagaccagat 

cacaccaggg 

gttcatcaag 

tgcctcagtg 

gtgtcaacat 

tgctgggcac 

ctgaatctca 

gggacgaagc 

gatgctctga 

tctcactctg 

cttcccaggt 

gcaccaccat 
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gcccagctaa 
gtctcaaact 
caggcgtgag 
cctgggaggc 
accagttatt 
aacaagagag 
accacaccca 
actcctgagt 
tgaaaagagc 
caaagggaag 
tagggattag 
gagagcacat 
caagtctgtg 
ggaagtggga 
gcacctttct 
tagccctgac 
gtcgtggggt 
atggtgagtc 
aaaccacgga 
tgggctgcaa 
aaagcattct 
acaaaattgc 
ttctcaatga 
tgagtgctgc 
tgatgcccac 
tgaagaaact 
ggaagagata 
cttttcctgt 
acagaagtcc 
gacatctctg 
ttgttgtgat 
gataaacctg 
gaatcctaag 
gaaatcaata 
ttgacacata 
atatccaagg 
gatcctatga 
ttccttgttt 
tgagacccac 
tcacttctgt 
aaaacatgga 
ctctgcaacc 
tagcacccac 
gaacagtgtc 
nnnnnnnnnn 
nrmnnnmmc 
ggtctcaaac 
acaggcatga 
cgggaagtgc 
ggattctcca 
aggacagtct 
accagagtcc 
ttccagagat 
cccacaagct 
taacttggta 
gctcacatct 
gagttcgaaa 
ttagccaggt 
aattgcttga 
gggtaggcaa 
ctttattctt 
tgataatatt 
ctatccatct 
aatgagaatt 



tttttttgca 
cctgggctca 
ccaccatgcc 
ttcccaagag 
actaatagga 
tagtgctgcc 
cccacacgca 
gaaaacaacg 
aggaaactgc 
ccggaagaga 
ctggagggag 
gtaggatggg 
gtgaaattag 
gacccagggt 
ctcttggggc 
ttgctgaaat 
gctgcaatgt 
cgccctggga 
agttcccggg 
aaccgcaaga 
gccctccagg 
ctcaggggca 
aagcaccctt 
ttttgggaca 
agccacccca 
gaggcctcaa 
gacttcccag 
gccaatgtga 
tctctgtgtc 
ttatactact 
taagtatagg 
aaatgcaatt 
gctgactggt 
ccagaaggtt 
ggtgaagctt 
cagggagcca 
agatcttacc 
gcttagatta 
tgaagaggaa 
gttgtcactg 
cttggaagcc 
tcgggcagat 
ctcaacatct 
tgccatctgg 
nnnnnnnnnn 
taatttttgt 
tcccgacctc 
gccaccacgc 
ctgtggggct 
tgacctctgt 
gatggctggg 
ttccgataca 
atccatgcat 
tttgacatca 
tctctggtac 
gtaatcccag 
ccagcctggc 
gtggtggcag 
acctgggagg 
caaagcgaga 
tttgatggct 
taagttatgt 
gtaaatgtct 
tctgggtcaa 



ttttttgtag 
agtgttctgc 
cagccagctg 
ctacacagag 
atgaagatgg 
tctctgtgat 
tgatcatgaa 
gggtttttca 
tggggccgat 
ctggaacagt 
aaggagaagc 
aaccccagat 
ggaaggctcc 
ccagtcctgg 
ctccattcct 
gctgggactc 
tcctccagat 
gcctctgctt 
gctgatgggt 
tgggctgtga 
tcttaaagca 
tcagggaggc 
ttgggtatta 
ggcattgggc 
taaggaggcg 
gagatgaagt 
tatccagagc 
tctgcctcca 
ctcttcagcc 
ccaatggtgt 
ggaaatgacc 
tattaattaa 
cagcacaatc 
ctttgttgag 
gacatacctc 
tctgggtgtc 
tttctggctg 
atttctgcgt 
taatcaatac 
tatcctcatg 
agaccgtctg 
tacctaagtc 
gtcaagagga 
taagcagtcn 
nnnnnnnnnn 
atttttggta 
aggtgatcca 
caggccccac 
ggtgtggaca 
tactcatatt 
caaaccctgc 
tgattctggg 
gcatcggcat 
atagtagcat 
tgctcaatat 
cattttggga 
catcatggtg 
gcgcctgtaa 
cagatgttgc 
ctctgtctca 
gtaggtggat 
cttcagcatc 
ttttgaacat 
agaggatatg 



agatggggtt 
ctgccttggc 
atgctcttaa 
catgagttct 
gtaattcttt 
tctgtgtctc 
aagaggaaat 
cattgagagc 
tgaacactgg 
ttccatggtg 
tggggtggga 
gatactcaag 
actgcggact 
ctctggagcc 
acctctgtga 
tgtacagagg 
gctgcacggg 
ggaagctgaa 
tcttatagat 
ccactctcaa 
aacacagact 
agcaggcctc 
ataatgacaa 
taagagctat 
gaggtactat 
aacttggcca 
ccatgttttc 
ggaatcctgt 
tgatagtata 
tccctcccct 
cacactaaac 
cactgagaaa 
tctttcagga 
tacaaagtca 
tataaagcct 
ctctcctttg 
catttatcat 
atttaataga 
atactagttg 
cttccttata 
ggtttgaatc 
agtttcccct 
ttcaatgagg 
nnnnnnnnnn 
nnnnnnnnnn 
gagacagggt 
cccgcctcgg 
acacctttta 
tgtgggtgaa 
cccacactcc 
ggcaaaccat 
cagctgttgt 
atgtgtataa 
attattaaat 
aagaacatat 
ggctgagatg 
aaacccccat 
tcccagctac 
agtaagccga 
ataaaaaaaa 
gttctaaaat 
atatgaaact 
ttctagaata 
cattttacat 



tcgccatgtt 
cccacaaagt 
tctgtgccct 
ggaatcgggt 
cagacagcac 
cctgccctgc 
ggatccagga 
tttgcccaac 
acttttgttg 
ctggaggatg 
ggggaacctc 
gcatggcatt 
gtagacagag 
ccctgggtgg 
agcgagtgct 
ctgacattaa 
agagggcaga 
gtggcctgag 
tgtacatgca 
ggaaagagcc 
caatccttat 
aaatgtgtgc 
cagtaatgac 
atgtaatata 
cattatgcca 
agtcactcag 
accattatgc 
cttgatgttc 
tcttttcata 
acccctccct 
aaactcataa 
tgaaaccacc 
aggacaggct 
gagggaaggg 
ccatcctgcc 
gacagtgctg 
gattgtggaa 
actgaaaggc 
tgttgccctt 
atggagggac 
ctggctctgt 
tctctgaatt 
gaatacacat 
nnnnnnnnnn 
nnnnnnnnnn 
tttgccatgt 
cctcccaaag 
aacaaccaga 
ggtggcactg 
tcaaattagc 
tccccagccc 
tacccgtgtc 
ttattatatc 
tgttctgtac 
agacctggcc 
ggtggatcac 
ttctactaaa 
ttgggaggct 
gatcacgcca 
aaaaggaatg 
ttgtgtaacc 
tacaaacaag 
attgcaggat 
ttaatagata 



ggccagactg 
gctaggatta 
acccagcctt 
tgatgggggt 
ccttgattaa 
tcacacagac 
gaaggagacg 
accccaaaga 
tggaaaaagg 
gggaagtggg 
cacttgccag 
agaccagaag 
cactggacaa 
gctgccccgg 
gaacctctct 
gcagggatct 
aaaggcctat 
agtgactcag 
gctctcctcg 
ctatctgcaa 
tccttttaag 
ctttctagaa 
agtcatttac 
tattattatt 
actttaaaga 
ccagtaaatg 
tgaagtacct 
ccttccccat 
ccattctttg 
gggagcttag 
gagactgatt 
cagcagatgg 
tttgggaaag 
agttgatgga 
aaggatcaga 
gatttttctg 
ggctttttgt 
aatttcccat 
tgcagagaat 
agagatggta 
tacttataag 
ggggatataa 
aaagtgctca 
nnnnnnnnnn 
nnnnnnnnnn 
tggccaagct 
tgctgggatt 
tttcattcat 
ggagaagtta 
ctgagtctcg 
tgccctctca 
ctccatgttc 
tatatttcat 
attattttat 
aggcacagtg 
ttgaggtcag 
aatacaaaaa 
gaggcaggag 
ctgcactcca 
tatagacctt 
aatctcctat 
gttgcattga 
aaactcctaa 
tttgtcaaat 



108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
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tgtcttccaa 
tttccccaca 
gaagctgttg 
agagaggtta 
cgatagcaga 
gaatggagga 
ccaccctgaa 
agtatttgga 
gaacattata 
atataatgta 
tttggcaggc 
catagcgaga 
atgcatgctt 
gagtttgagg 
aggaagacca 
aaaagcagag 
ctgttactta 
aacatataaa 
gagataatgt 
ggaaattcat 
ctttgatctg 
aggtataggg 
accaggcagg 
cccaaagggg 
tttatcgcgc 
gcttttacaa 
tttaaaatta 
tagcgataca 
tcagctcaaa 
gtaaaattag 
gataatgttt 
gaggatgatc 
caggctggct 
actgcacact 
agggtatatg 
acaagatttt 
ctaggaggac 
gtctcaaaaa 
acgtgctcac 
tcatcacagt 
tttgttttta 
tcttcaagta 
catattctcc 
tgcaaattct 
tttctgcata 
aactgcagtt 
gcctggagct 
ttagtcagtc 
tccagagttt 
agtgctgggt 
cacacggctt 
tgtttactct 
ctgttgaagt 
ttaattgctt 
aactacaccc 
gtcactggcc 
tcccccacta 
ccatttgccc 
agtggcttca 
cagaaatgag 
ggtgcctata 
cagagtctct 
tcacggccac 
ttacccactc 



agtggtcgta 
ccctggagag 
gccgttatat 
gttagtgggt 
ggagggtgac 
cttaaaatat 



tgtgccagat 
tgggagaaat 
cattctatat 
tcagtaaaga 
cagggtggga 
ctctgtctcc 
atagtcccag 
ctgcagtgag 
tgtttctaaa 
tggtaagaga 
cccatcgtgt 
atagaagtaa 
atataaagct 
tgttgtagtg 
gatttggctc 
tggtctgagc 
tgctgataag 
ggtgtacaaa 
actgaaaaac 
aaaatgctaa 
ctaagtcata 
ttaatgactg 
ttctgcaatt 
gcaggagggc 
gtttttatct 
tggctgtgac 
gggcaggtgt 
tggtcaaaga 
agcacctgtt 
attgtgcatg 
tgcttgagcc 
aaagaaagaa 
ttctctcttt 
gagcacaggc 
cagaactcat 
aaaggaaaac 
agactttgtc 
tttatcctga 
ttctttataa 
ttcttaacca 
tttttttgtc 
aaaaagtaat 
tgcacactgc 
gatggtgttt 
tcatttctat 
ctgcataccc 
cttgaggctt 
ccagccagat 
tggacagatt 
tactgtgttt 
ccagtagcag 
cacctcctac 
acccacagaa 
aaaagcgttg 
aggaaatggg 
tgccccatga 
tctaggcagg 
tgctgaacac 



ccaat-taaca 
atgaaaaatt 
aaccctcgtc 
gcaaacatac 
tatagttaac 
tccaacacat 
cttgtttgtt 
gataaatgct 
atgtaacaaa 
gagggctggg 
ggatagcttg 
acaaaaaata 
ctacttggga 
cctacgactg 
agaaataaat 
aaggactttg 
gataagcaaa 
aaatcatacc 
tctggaataa 
ggcagccttc 
aggcaagacc 
ttccccagca 
accggaatgg 
acgatctcgt 
aaaaccaaca 
agtaggcatt 
gtacataatg 
aagttcttta 
acacaggctg 
taacctcgct 
cacagagttg 
acctgtcacc 
cccctttctc 
ggaagacctt 
ctgtcctgcc 
gtagcatgag 
caggagtttg 
agaaagaaaa 
ggaatgtagt 
taaagcatct 
tatgttgtga 
cacccatcat 
aacatgaata 
cttttataaa 
ttatcatttt 
tttcactgtc 
tattgcatta 
aatattttca 
tcccagagat 
ataaacttct 
ctctttgatc 
tcttgggtga 
gttttataaa 
ttggccactg 
aaattaattg 
gatcctatag 
gaggctccat 
aataccccac 
tttaagggga 
ttcacatggg 
ggagggttgg 
tagagggttc 
gcttcccaat 
agatgttacc 



ccccgacctg 
tatgggccca 
attaataagc 
aattagatag 
aacaatgtat 
agaaataata 
cttggaagct 
tgaggtgata 
atatcacacg 
cacggtggct 
aggccaggag 
aaaataaaaa 
ggctgatgca 
caccactgca 
taaataaaat 
gggctcaaca 
tgccttaacc 
cacttcaagg 
tgcctggcac 
tgaatctgtg 
tggggaaggg 
gagtgaggct 
gaggctggag 
atgactcctt 
tatttaatga 
cacatgttta 
tgtgagccac 
aacatcaaca 
gggttgaaac 
gaatctcaga 
ttgtgaagat 
ccactgatct 
cctcaccact 
tcctgataga 
agaatctccg 
cctgtaatcc 
agaccagcct 
gaataataat 
aagtgtacct 
tgactttatt 
aaataatttt 
tctcccaacc 
cttactttac 
caaatatgat 
gtggctgcat 
tggggaaatg 
tattcttaag 
aggctcttgt 
gtaggaacac 
gctaatttaa 
attaacaggt 
agtagtcatc 
tgtgagcgag 
aggctcctga 
gagaaaatgg 
tgggggcctg 
gtgtccccca 
atacatgtac 
aaggaattgt 
atcacctaat 
agagttgtgc 
tcaaattaag 
gcttccccaa 
catagcacct 



taatgaatga 
ctttggagtg 
ctgggggtgg 
aagtaataag 
tgtatatttc 
aatgcttgtc 
aagcagggtt 
gatatcctaa 
tatcccataa 
cacatctgta 
ttcaagatca 
cgaattagcc 
ggaggattgc 
ctctccagcc 
aaataaaaat 
gtactagcct 
cctgtgtgcc 
gtcattataa 
acagtaggag 
tcctctttgt 
cagagactga 
gggaaaggtc 
cataaggcag 
tatactgtta 
atgattccaa 
aaaattgagt 
agctatcccc 
tacaatgcca 
ccagcttttt 
tgtctagtct 
tcaataaaat 
ccagagttga 
ccgcatgcat 
ggaggaccat 
aaggagctct 
cagctactca 
gcgcaacata 
agtaataaat 
aataaatgtg 
ctataagcaa 
ccaacattaa 
ttcaataatt 
atggftcgcaa 
gttaitaaacc 
aatattgcat 
gaggataatg 
atcaaatccc 
tatattttac 
agacgtcatc 
taagtatgaa 
tgaactattt 
cacttccttc 
cacttcagag 
gcaggggaat 
gctgagaggc 
aactggggca 
tattagagct 
tcactctccc 
tctgtcctgt 
gaagggatgc 
aaaatgcaac 
agagtctaca 
ccccacctcc 
tgcaccatga 



gagtgccttt 
catggtggag 
ggggggagaa 
ttctaatgtt 
aaaatagcta 
tgcggccatg 
gaacctggtt 
ataccctgtc 
atatgtacaa 
atcccagcaa 
gcctgggcaa 
aggcgtggtg 
ttgagcccag 
taggcaacag 
aaaaagactg 
tgaaccctgg 
tcactttctt 
aaagccaata 
tttaataact 
ccactaatgg 
gggcaactgg 
tgggagacag 
ttcagttttt 
atgttttcat 
ggggattctt 
tgatttaaat 
aaaatcatga 
attccagaat 
tgctaactgt 
gtaaattgaa 
cacaacatgt 
ttcggctgat 
tcctcccgaa 
tcttcagtca 
cagtaaaatc 
ggaggccgag 
gtgagaccct 
cacctgtgca 
atcattgtaa 
taaaagagga 
caaagaacat 
ttcaattttg 
tcagtgttca 
ggtctccatg 
tgactatgtt 
ccagggtcat 
agcagtgaga 
tagattgttt 
caaccttgcc 
atgctatcct 
tccaagtatt 
acctgtttat 
tcaatagaca 
gcatgatcaa 
agagatatgt 
acggcctgag 
tgcggcactt 
ttgcaaatct 
tcacttactg 
catccccaac 
agggaatcat 
gcaactaatc 
atcctagact 
ttgtttgatt 



112320 

112380 

112440 

112500 

112560 

112620 

112680 

112740 

112800 

112860 

112920 

112980 

113040 

113100 

113160 

113220 

113280 

113340 

113400 

113460 

113520 

113580 

113640 

113700 

113760 

113820 

113880 

113940 

114000 

114060 

114120 

114180 

114240 

114300 

114360 

114420 

114480 

114540 

114600 

114660 

114720 

114780 

114840 

114900 

114960 

115020 

115080 

115140 

115200 

115260 

115320 

115380 

115440 

115500 

115560 

115620 

115680 

115740 

115800 

115860 

115920 

115980 

116040 

116100 
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agcacctccc 

ctgcagggct 

tcagcaagtt 

aaacaatggg 

ggttaagact 

agcctggctt 

ccagctgccc 

acagcctggg 

cctagcctcc 

ctaggttcaa 

gccaggcccg 

cacttgagga 

aaaatacaaa 

tgaggcagga 

attgcactcc 

gcagcttccc 

atgcacatga 

ccccttggcc 

agcccacaaa 

ctaagtgtgg 

cttcttgccc 

acagggctca 

ctgtttctcc 

atcagccagc 

tataaataat 

aggaggatgg 

aaaaatacaa 

gctgaggcag 

ccattgcatt 

catacataca 

accctaaaaa 

aatttaggtt 

ccaggcagaa 

gtccctccac 

gcctggtgac 

aatctgattt 

tgggatctag 

ctggatgata 

ccttcaatga 

tcggaggcag 

cacagtgggc 

tagtacacgt 

cactccttat 

tgggaggcca 

tagtgagacc 

agctacttgg 

agccaggatc 

gtctcaaaat 

caatgcctat 

accaaataca 

aaacaactct 

ttatctatgt 

agtcaccagg 

tttattgatt 

tcgacagggc 

ttgctgggat 

actggggttt 

gagccagggc 

acactaaccg 

ggaggttctg 

cattgagagt 

gtgccaataa 

atggggtggg 

tctccagaga 



acagtagact 
gtgccaggtg 
cagagtcaag 
actcaacttt 
ggtgtggagc 
tgcagctttt 
ctaacacctt 
aggcaccacc 
caactgcaag 
atttagaatg 
gtggctcacg 
cagaagttcg 
attagccagg 
gaattgcttg 
agcctgggca 
tccacttccc 
atatccaacc 
accctttggc 
gggccgggaa 
tctagaagag 
ccatggaggg 
gggaaggact 
atctgtaaga 
acacgtaagt 
aggccaggcg 
atcacaaggt 
aaattggccg 
gagaattgct 
ccagcctggg 
tacatacata 
tgagacaggg 
ttcttgcctg 
caaggtgtgg 
ctcccagcac 
cagtgggaga 
tcccccaggg 
ctgagaggct 
cgaagatgta 
tgggacactg 
agaccaggct 
agccctagca 
gggtaggaag 
gagaccacag 
aggcaggcag 
cccacctcta 
gaggctgagg 
atgccactgc 
aaataaataa 
ttctccaggt 
gggtcagcta 
atgagctggg 
atttatctat 
gatggcctca 
gattgattga 
tggtcttgaa 
taccaacacg 
gaaaaggtaa 
acaaacccat 
ccaagtgttc 
tggactcact 
agctgcgctg 
acccctcctt 
cgggggcggg 
tcttggcttc 



gtgtttctga 

cacaaaataa 

tgggataggg 

ctaaccaaga 

ttcattaaag 

aagtcatgta 

gcaggggcag 

ccactagtgc 

catcaatctt 

atccagctcc 

cctgtaatcc 

agaccagcct 

catggtggta 

aacccaggag 

acaagagcga 

aaccacagct 

aacatgtcta 

catgggttca 

gtccacttgg 

agatgcagct 

aaactggcta 

ctttttgacc 

aggggacatt 

tcatgacatg 

tggtggctca 

caggagtttg 

ggtgtggtgg 

tgaacccggg 

tgacagagca 

catacataca 

aaagagaaaa 

ttttagaaag 

aactcactgt 

agtaggcagc 

gctgaggagc 

cccacatcac 

ccatttttgg 

gctttgcagg 

aagcccacag 

agaactcagg 

agtgccaggt 

tcatatctaa 

gtgctgggtg 

attgcttgat 

ccaaaattag 

tgggaggact 

acaccagcct 

atgacagcag 

agtcactagg 

ctctgttatg 

aacaagtatc 

ctattcattt 

aacttgagct 

ttgattgact 

ctcctagtct 

agccaccatg 

gcgacttgcc 

caaagcctgt 

tacacagtga 

cggtgggtat 

ttagtaagtg 

ctacctggtc 

tgtcctcctc 

ccgtggggct 



taggtcagca 
acaaagccaa 
aggctctctt 
gaactccagg 
aagaaaagat 
acctttgatg 
ggctggagtg 
aagccgggca 
gcacttggaa 
ttgaagttct 
cagcactttg 
gaccaacatg 
cacgcctgta 
gcagaggttg 
aactccatct 
ccatctcaga 
aggcccaacc 
tgcactggca 
gctttttgag 
tctgggaggc 
gatgagggcc 
agatctaaga 
aatagactct 
aagcaagggc 
cacttgtaac 
agatcagcct 
cgtgcacctg- 
aggcagagtt 
agactgcatc' 
tacatacata 
acatgttctg 
ggcctggaca 
ggaagggttc 
acgtgtctcc 
ccagggtggg 
gtgcccagag 
tagcttctag 
actctctaga 
aggagaggtc 
acttttaatt 
agggtggaac 
gaactgaccc 
cagtggttca 
tccaggagtt 
ccaggcgtgg 
gcttgagcct 
ggatgacaga 
atcatcattt 
ataaaagtaa 
ttctttcatg 
gtccttcttc 
atttatttat 
aggaactagt 
ggttaatttt 
taagcaatcc 
cccagcccct 
caaggtcccc 
gctctcgccc 
aggtgacaaa 
gcccagaggg 
agaactcgga 
tcctttccaa 
cttctcaggg 
gcagatccac 



acatttgctg agcacctact 
agacaacatg gaccctgaac 
cactggaagg taactccaag 
gagctaaaat tctgacttct 
tcacccagac ttgagttcat 
aagttatgtg acctctccac 
caaagggagg cactggtacc 
acctctgccc ccaaggcatc 
aggaacctca cctttgaaat 
atacagaaat acagccagca 
ggaggctgag gtgggtggat 
gtgaaacccc gtttctacta 
atcccagcta cttgggaggc 
aagtgagcca agatcgtgcc 
caaaaaaaaa caaaacacac 



caacaagggg 

acaccctctc 

gaaaggtagt 

attccagggt 

acattccttg 

aaagcagagc 

gcagcactac 

ccccgctaga 

t.taatatata 

cccagcactt 

ggtgaaacct 

tagtcccagc 

tgcagtgagc 

tcaaaataaa 

catacaatac 

acaaccttgc 

ggagccctgt 

tgggtgacaa 

attgactggc 

gtctgaagga 

ctgggaaagt 

tttgggagtc 

acatggagtc 

tgtcccagtt 

tggaccagga 

tgtgaaggtc 

ccagacctgg 

cacctgcaat 

cgagaccagc 

tggtgtctgc 

gggaggcgga 

gtgagacaga 

ttctttctgc 

aaataatatt 

ctttgtttct 

ctcccatctt 

tttgagacaa 

gtcaccaccc 

gaggcagggg 

gcccgcctca 

cccatcttct 

tccctagcta 

attgagccac 

gaggtgaagg 

aagggggatc 

agtccagact 

agccagctgt 

aaaatccgac 

ctagagccac 



cctcatgtcc 
caaacatctg 
tcagagaaga 
ccaggataac 
gtcttaggga 
cctctaaagc 
ctctctgagc 
gttactctac 
cccgttgtac 
tgggaggccg 
catctctacc 
tacttgggag 
caagatctca 
taaatacaca 
atggacaggg 
cctttatact 
tcccctcagg 
gtgcagcccc 
tcaggagcag 
atccctagaa 
ggaggcagca 
acagagacac 
caagatattc 
actcagccat 
ttccttttac 
atccgagggg 
ctctgccact 
cccagcactt 
ctgggaaaca 
ctgtagtccc 
ggttgcggtg 
atgacacact 
ctctagactg 
atcagcattt 
tttaagcctc 
atttatttat 
ggtccttcta 
cccaatttct 
tctcgctaag 
gcctcccaaa 
gaataggaaa 
gagagcttca 
cggacctcgt 
gaagagccag 
ttgggtggca 
catccagtct 
tctccagaca 
gctgagccca 
cagagggcgg 



116160 

116220 

116280 

116340 

116400 

116460 

116520 

116580 

116640 

116700 

116760 

116820 

116880 

116940 

117000 

117060 

117120 

117180 

117240 

117300 

117360 

117420 

117480 

117540 

117600 

117660 

117720 

117780 

117840 

117900 

117960 

118020 

118080 

118140 

118200 

118260 

118320 

118380 

118440 

118500 

118560 

118620 

118680 

118740 

118800 

118860 

118920 

118980 

119040 

119100 

119160 

119220 

119280 

119340 

119400 

119460 

119520 

119580 

119640 

119700 

119760 

119820 

119880 

119940 
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gccagcactg 

ccagtgtcca 

agcccaactc 

agaattactg 

acctcatttt 

cgatgaccgg 

attgtctagg 

ctccccccaa 

gtaagagggg 

atgagaccag 

gctggaaaca 

tgggggtgga 

gcggggatga 

ctttcctcgc 

cttgggggcc 

gacagcacac 

gtgatatttc 

acccagcgac 

acccccaaca 

tcggggtggg 

ctgggtgagg 

acgagagggg 

ccgaaaatgg 

ggaatgaaac 

acgagagaca 

gacagtgaga 

gagagagagt 

cgggggagaa 

agcagagaat 

cactgcattg 

aggggcttct 

cacctgcggc 

gctcccccga 

gagtcaagac 

ctgctgatac 

tccgcccctc 

gctttgggtc 

acatacacaa 

cactaagggg 

gcgcaaaacc 

ttcagcggct 

caaagggtaa 

cacgctcata 

gcacactggg 

ggacagaaat 

gggtgaccca 

cggcgttcct 

gcgaggacga 

gagggggtta 

cagcaaacga 

cctcgctgtc 

gactctcgag 

cagccttccc 

ggcccgcagc 

tcccgggccc 

gagtgggagc 

gggctcaggc 

gggagagaag 

ggtaacctgg 

tctcagcagg 

ggtgggacgc 

tgtccccgct 

actcagcaac 

atgtgggtag 



cggccaaggc 

agagtccagc 

atgttcaggc 

ccaaaatagt 

accaagcact 

gtagcctcac 

gtcacctgga 

ccctaaccgg 

tcagagtgct 

ccagccaccc 

cgacaatgga 

gggggtgagt 

gcagaggggc 

agctcgtccc 

cggggtcccc 

ctgagtcagc 

gctccccggg 

agattgtgcc 

ctccctcccc 

tcctggggaa 

gaagacggga 

cgggaaagga 

aaccagcagg 

tggggaagag 

gagacgcagg 

aagacagaag 

gagggaggga 

aaccagagaa 

gaatgggaag 

gaccctggca 

gtgctcgcgg 

tccagcagcc 

gggccccgag 

ctgggtcaac 

cgcctgtgac 

tttaaaaggg 

ctctcctgca 

aaaaacgcac 

catacacaca 

tcgcacagcc 

cacccgtgtg 

cgggcaagca 

tgcactcgat 

cttgaggtct 

gctggaggct 

gaagctcatc 

ctaaactacc 

tcgaacacag 

gaaagccgct 

gggaggagcc 

cccactctcc 

cggacagcgc 

cgccccgtcg 

ccgggcggcg 

tcccggagcc 

gccctccccc 

gcctgctgca 

cagggcgtcc 

ggaaggggaa 

agggcacttg 

agggggcagg 

ccacctgctc 

tcctcctgcc 

ggggctgtct 



ttgaagaccc 

cagaggccga 

atgatgtgtc 

aacgacatta 

ttctcatgcc 

cgtacagatg 

gaactctgat 

agctaggtgg 

ctacagagaa 

acccaccagc 

tgacacgagt 

cagcagataa 

tgcgactgcc 

aggaggagga 

tcccacccct 

ccgccgccca 

agccagcccc 

gcggctcatt 

gccgccgcct 

acgcagggtc 

gccccagaga 

aatttagggg 

gggcacccga 

ggggagagaa 

gagccccgcg 

accgggcagg 

gggaacagag 

atcgaatgag 

aataagacca 

cggataggaa 

gagcggcagc 

ccaaccccgc 

cct:atcctac 

cgccctgcag 

caggccatga 

ggtagaggaa 

cccccgcggg 

acactcgcac 

ccgggcacat 

tcacgtttcc 

cacactcacg 

tcctgagtca 

cttgcacgca 

gggagtggaa 

gggacactgg 

ttctcctgga 

gcattccccc 

tcctccgggt 

cccgcctcct 

agcgagtgcg 

ctcggctagc 

agctagcggg 

cgccccgccc 

cagggtagag 

ccgcggggtc 

cgctgccccc 

aggtaagaac 

cctctttcag 

aagctcagcg 

gctgggagcc 

tggacccggc 

cgggggacgc 

cggtgcctca 

ggggaggtag 



agcacaccaa 

gtcctcgatc 

tgattctact 

gctacctacc 

tggggtgcct 

aggaaactga 

ctccagactc 

ggtggggaca 

gaccaaatgc 

cagccaccta 

tccctgccct 

ttatgggaaa 

tggagccagg 

gctccccccc 

ccccgaagag 

cccgcccctc 

actgcgctcc 

ccggggaagg 

ccaggccctc 

ctgtcctgcc 

cttttctttc 

cacagagagg 

gagccgaggt 

agggaggcag 

gggaggagga 

gaaacagacg 

agacagagac- 

aacgcgagaa 

acatttatca' 

aggaggagcc 

ccagggggct 

cagcgctgcc 

gccggggcgg 

cctttgtagg 

agggccagag 

gttcacgcga 

agataaggtc 

gcgcgcccat 

atttctttcc 

caccagctca 

tgcccccccc 

cacctgcaca 

caaactcttg 

ggaaaagtgg 

cgcgagggac 

aagttgggag 

aagaagggat 

cgcttaagcg 

agtggtcgag 

ggaaggagtg 

agcctgggca 

gcgcgggcgc 

cgtcccgtcg 

cgccgcggcc 

cccgccgtgc 

tcccccgagc 

gccagcggcg 

ggattgaggg 

ctggggccgc 

cgcgggcgcg 

ccggagcggg 

tgaggactcg 

gcactttctg 

gaggcgcaga 



agcccggcca 

tcaaaatgtc 

gggacaatica 

acccctccac 

ggagacttaa 

ggcacaagga 

aaatttccaa 

gcaaatgtgg 

attgtggcac 

aaagtcttca 

caaaaagctg 

ccgtgacacc 

gattcccgga 

agcttcgggg 

cgcgggcccc 

agcgtctgtc 

ggaggcagct 

acgccaaacc 

cccccaggcg 

tcctggaaat 

tgtttctacc 

agctgggggg 

gcccacgggc 

agacaccgag 

gagagacgaa 

agtagagaca 

cacgaaatat 

gaacgagaga 

agagccgact 

gcggcgcggg 

cagcagcccc 

tggccaccgt 

ctccgcggac 

gaagtgccta 

gggctccagt 

agccaacagt 

tcccctcccg 

ctcgcacccg 

acccatcccc 

gacatgcacg 

cccgcttccc 

agcatccttg 

catatactat 

aatcttggag 

gcggctgggg 

gggggaacag 

ttctctagaa 

ggggggaggg 

aaagggttaa 

ggggtggttg 

cacggacaga 

tgggcgtcga 

gggccgatgg 

cggccacgca 

atccggcggg 

atcgagacaa 

ggagagcgga 

tggggcagtt 

gcccccgccg 

tgcgaggagc 

gagggaggct 

ggccggctgg 

gccacctggg 

gggaaatcca 



agcctccagc 

taactgcaga 

ttgccaccaa 

acaccaacac 

tgcagcctcg 

aggggagtac 

ttcgtccccc 

atggggggag 

ctactgtaaa 

gtgggcacct 

gtagtctagt 

tgtataaggg 

cggggcttcc 

ttccgcctgc 

gggaaccgat 

tccgcatctt 

cggcaaacaa 

ccaccctgct 

caggccctag 

agggggagcc 

tgatccgaaa 

ccgagaaggt 

cgggagcctg 

acacacagag 

gacacagaga 

gaaaaggtcc 

gagtaagagt 

ccgtggaggg 

gtatgccagg 

cagcggggcg 

ggcaccgccg 

acccgaagcg 

gcgccgggcc 

ggtgatgggt 

gagaccataa 

cttctcccca 

gacacatcat 

cttgtaaatg 

aagatcgcaa 

ctggcggact 

caagcccgta 

cgcgcacgtg 

tcttatagtc 

ctgtcccagg 

gcgggggagg 

gacaagtcca 

gagtggcgcc 

gggcggggtg 

gtcggcaagc 

ggaagagctt 

cggactgacg 

cggccagccc 

ctcctcccga 

gcccggggac 

ctcagggagc 

gatgctgccc 

gggcatcctg 

ggggaggtgg 

ccagggctgt 

tcgtgaccga 

caggttccgc 

ggaagcgccg 

aagacaggag 

agtggccctc 



120000 
120060 
120120 
120160 
120240 
120300 
120360 
120420 
120480 
120540 
120600 
120660 
120720 
120780 
120840 
120900 
120960 
121020 
121080 
121140 
121200 
121260 
121320 
121380 
121440 
121500 
121560 
121620 
121680 
121740 
121800 
121860 
121920 
121980 
122040 
122100 
122160 
122220 
122280 
122340 
122400 
122460 
122520 
122580 
122640 
122700 
122760 
122820 
122880 
122940 
123000 
123060 
123120 
123180 
123240 
123300 
123360 
123420 
123480 
123540 
123600 
123660 
123720 
123780 
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tctggtagga 

aagggtgtgg 

aggagaactc 

gggggagggg 

ggtgaagacc 

agactatgga 

gggaccctgg 

gggatggtga 

atggatggga 

agggactggg 

tccacgtggg 

gtggctggtg 

gacagagggt 

gctcctgtgc 

accacggtgg 

cagctctaaa 

accaccacct 

tgcccgcagg 

gtctgggacc 

atcctttcct 

ctgcccccta 

cggcccaaag 

ttgatagcca 

cttcagcctt 

ttctggccat 

tctagaacaa 

ttccttgcct 

gcacctgggt 

cgtggtttct 

tgggcttgat 

ggccactgtg 

ttgacctcca 

tgcagtgcct 

ctgcagctct 

aggctgctac 

ctgagaaaga 

aactcacact 

taggtggcca 

ctccaccggg 

ggaggcctgg 

gggaagggac 

ggggagagag 

ccctggagtg 

cctcccctcc 

gttacagctg 

gcagcagctg 

ccactatcat 

aggagagtgg 

ctagggagag 

cggaaggaac 

ctgtctctcc 

gggctagcgg 

ggatcagaga 

ggtgactctg 

tccctgccaa 

tcatcacctc 

gaaatctcca 

tgtgcttcca 

accccacgtg 

aggggtgaga 

tgtcactgag 

acgcggagga 

gacactcacg 

ccgcacctgt 



gcgctagaaa 
gggggtgggg 
aaagtggtga 
aaaggttaga 
gaaggagagg 
gaatcctggg 
ggaaaaggag 
ggtatcagga 
agaggaggga 
gctgcccagg 
ttcccaaagc 
ggggtgcttt 
gaaggggagt 
ttatccggcc 
tggggtgtta 
ccgcgtcagt 
cctcagctcc 
tccctgaagc 
ccttccagag 
tgcaccagcg 
agttttcttc 
gggctgagaa 
cagtggtccc 
taccatgttc 
caggggtcat 
aagcctagga 
tagccaaaca 
tcctgtcttt 
tggccacctc 
tccaatctgg 
ggggctccct 
tgtgttgtgt 
aggtgcttct 
tgctgacttt 
cctctgtcca 
gagctgtaga 
agccctcaat 
cccctttcca 
atgaaggcca 
aaggagtggt 
agaaaagggc 
agctctctag 
ccccagcagg 
cagtgagagc 
cagtggcaag 
gcccccagca 
aggtgagccc 
gagaagtagt 
aggcaccctt 
ggggaatggg 
gtcagtccgg 
tcacactctg 
ttcttttggt 
aagtgccctg 
tgtgaccttg 
atgcttgaac 
ccaaacctca 
ctgcctggtt 
actcgggagg 
aacgcggagg 
cacccagcct 
cagccagcac gcaccgacgc 
gtggggccca aaagcgagga 
gtgctccccc ctctggagga 



gagatggagg 
gccagagact 
aaagatggag 
gagaaggaat 
aaggaagagg 
cagggaccca 
ggactggaat 
ctagagaagg 
gaggggctgg 
gattgggggt 
aagccctcca 
ggccagatgg 
gaggactttg 
tgccatttat 
ctgcttattt 
cctccgtcac 
ccatcccact 
ccccagccct 
ctgtgcccct 
gggaccactc 
ctttgggtcc 
agtggcctgt 
gagtcctgga 
ccgggatcaa 
atattctgga 
ccgcaactgg 
tgaccctgaa 
gcaatttctc 
gcggaggatg 
gtccctcttt 
tcatgtttct 
ggggattatg 
aggacaaaat 
agccatcccc 
aaagggcttt 
caatgaaggc 
tcccctcccc 
ccgaggtatc 
aattcacaaa 
gcctggaggg 
gtggcaagaa 
ggaaaggcac 
ctcgctccag 
tccttgcctc 
ctttgcacgg 
ttatggagga 
gggctggggg 
ggaaagagaa 
aaagaggagt 
cctgatgcca 
tggggcagcg 
ccatcagggc 
aggatttggc 
acaggaaggg 
cactttgctg 
agtagaactg 
gttctaagcc 
cactccacct 
ggtctaagaa 
agggccgaga 
tattacatca 



gaggatagtt 

tggggagggg 

gggaagctca 

tgggaaaagc 

aaagaccaga 

atggaggtag 

aatggaaaga 

accggcgaag 

agaggaggag 

gatgagacaa 

gcctgcctgc 

gagctcaggg 

gacagcctcc 

cggacgttgg 

cattctggca 

gcctccccag 

cacctcctca 

agcccgttgg 

cgagaggcct 

actctgccct 

tcctcaagtt 

ctttgtttct 

tcagtgaaca 

tggcctctct 

ctcttcttga 

ggtgaaagtt 

gactcagcat 

acctcaaggg 

ttaaagaaag 

gttccacctt 

tccttcatct 

tcaatgtata 

tggtttcctc 

agaggagggg 

tgaagtagtg 

gacctggtca 

ggtgttacct 

gctctgatac 

ggagtggagc 

ggtgctgagg 

acacactggg 

tcccccaacc 

cctggggcag 

cgctgcgtga 

ggccagaaat 

ccgggtcgct 

tggttggggg 

ctaatgtggg 

gccagctcct 

ccttgcctct 

aggctggggg 

gtgagcttgt 

gtcaagtcct 

gctgggcatc 

ggtaagtcgc 

cagaggaatc 

accctcatga 

cagatgaggg 

agaaagtaat 

tggtgtgaac 

acacacgcac 

agcaccgacg 

gcagcacact 

ggaacaccag 



ctactgattg 

tcagggggag 

aaggaggagg 

gtggagggaa 

tcaggggagg 

cgggaaagag 

tcagaaacca 

agggttggag 

gaagagaaag 

agaggcttct 

cacatttctt 

ccctgggggt 

gagggtgggc 

attctgcagc 

gagtgagaag 

gcccctgcgt 

gcccctgccc 

gtgtggagcc 

ctgctgcctt 

gtgggtgggt 

ccttcttcta 

gagccttggt 

gatacttttg 

tggctccctc 

ctcccacatg 

gcagagagag 

tgtggctggc 

cagtgtctca 

atcagagtgt 

gtactagctg 

gcaaattggg 

aagaagttaa 

ctaccctgct 

gtgagcagag 

gagggatgaa 

ggaggcctgg 

atgatgtgag 

tctgtgcatc 

cgtggtcctc 

aggagtgaga 

cagggcaggg 

ccccagtccc 

tgaagcccag 

attatggatg 

ggcaacagag 

cttcagagag 

agacatggga 

caggtgggga 

gcagagttta 

gagattgcac 

aaagctctgt 

ggataagggg 

tagggaagtg 

aagagacttt 

ttgctctctc 

gaggggacct 

aacccactca 

gtaagagaca 

cgtgaaacgc 

gaatggaaca 

ggggcccggc 

cagcgccagg 

gggagtgtgg 

ggcagctggg 



agtgacagat 

agggatagga 

gagatggagc 

gtgggaccca 

gatgggaaga 

aatcaggact 

gagaaggatg 

acagggaacc 

gctgagagag 

ggtaaccact 

ctctcaggga 

ggctgtgagg 

aaacaggctg 

cgctgccgcc 

ctgtttgcag 

cactggcatc 

cctcagcatc 

cttgcttctc 

tccagggagc 

agcctggatc 

caggggcctc 

actccaaggt 

gctctggaca 

ccctggtcag 

taatcactac 

ctggagtccc 

ccagccctag 

cacattcagg 

ctctctgaca 

catgacctga 

ggccacaata 

cctgtacaaa 

gtactctccc 

agggtggggg 

atgaaggctt 

ggtgctcaga 

gggtcggctc 

ttgccccagt 

gggagagaca 

acagggggtg 

actgagggcg 

accacctctg 

agccccctcc 

agctccttgg 

tcactgttat 

cctgcaggga 

acaagatgga 

gcaggagagt 

ccctcaaggc 

atccttccct 

aatcctccag 

taggattaag 

gagatcagag 

tctggccctt 

tgagctccag 

ttgcggcttt 

gggtccccac 

ttgctcctcc 

cgcacggggg 

gcagccgctg 

gctcacacac 

aggggccggg 

atcttccacc 

atgccagcgc 



123840 

123900 

123960 

124020 

124080 

124140 

124200 

124260 

124320 

124380 

124440 

124500 

124560 

124620 

124680 

124740 

124800 

124860 

124920 

124980 

125040 

125100 

125160 

125220 

125280 

125340 

125400 

125460 

125520 

125580 

125640 

125700 

125760 

125820 

125880 

125940 

126000 

126060 

126120 

126180 

126240 

126300 

126360 

126420 

126480 

126540 

126600 

126660 

126720 

126780 

126840 

126900 

126960 

127020 

127080 

127140 

127200 

127260 

127320 

127380 

127440 

127500 

127560 

127620 
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cacactcggg 
gagcacctgg 
gcaacctcag 
gccttggaga 
cccaaggagc 
agcctaggac 
cctggctttg 
aggagtcacc 
tatcagctat 
tatgagtaca 
catcagtatc 
aaaaaaaaaa 
gccaaggtgg 
gaccccatgc 
tgtcacccag 
gttcacacca 
atgcccggct 
aggatggtct 
attacaggcg 
ggcacgggcc 
agtgatccaa 
caagaccctg 
agcctggggt 
aggactggct 
ggctctgact 
tggcaagacc 
ccaagttact 
atgtctggag 
tagaagacag 
ccggcccaac 
tctagacaag 
ccaatttgca 
tcagaaagtg 
catgggagca 
ggtcttgttg 
aaactcctgg 
acatgccacc 
tgttgcccag 
aagtgctggg 
ttggggggct 
tcccctcccc 
ggcggccccc 
ccaggccaac 
cctgcagaac 
cctgtggctc 
agccctggag 
cttccagggc 
gcccggcaac 
cctgctccac 
ctctctgtgg 
acccttcctg 
ctccaggggc 
tctctctctc 
gtctatccct 
tgcctcttct 
cgtgtgtgta 
cccccagggt 
gcccacatca 
aagttgcgct 
gattcccatt 
' tgttatgcag 
attgttatta 
aaatcatccc 
aaaatcttcc 



gcctgtcagt 

ctcacgccac 

gagcgtgaca 

ccccacactc 

cccaggatgt 

gttgcttgaa 

ggggtgcagg 

tggaaatcat 

agtgacggca 

gacacctgca 

ataataagga 

gctttggctg 

gaggattgtt 

ctacaatttt 

gctggagtgc 

ttatcgtgtc 

aaattttttt 

cgatctcctg 

tgagccaccg 

tgtggtccca 

gctgcaataa 

tctccaaaag 

gtaaaatgca 

ggtggcctca 

ctgcaagggc 

agaccccaga 

tccagtcatt 

acattttggt 

agatgctgct 

tgtcaatact 

gtcaaacatg 

gtgctttcta 

gaagtggcca 

ccagggctct 

tgttgcccag 

gttcaagcga 

atgcctggct 

gatagtctca 

attacaggtg 

ggcccccagc 

acagctcccg 

agctgcccca 

aacttctcct 

aacctcatcc 

ttctccaaca 

gagctggacc 

ctggagcggc 

atcttccgag 

ctacaggtga 

gcccctctgc 

cctcagcatc 

tttacttttt 

tctccctcta 

ttccatccat 

gtctgtctcc 

tctgtctctt 

cccctgccca 

cggtgaagta 

tctctccaat 

ttccaaacct 

ctggagaatg 

ttattgatct 

actccatttc 

cttggctaca 



cccatgcgtg 

tacccaaaat 

caacacacac 

aaaatcacca 

cagagtgcag 

gcagaaggtg 

ctcccttttt 

aagaaagtgt 

aaggccaggg 

ctccactctc 

tggatcatat 

gatgcggtag 

tgaggccagg 

tttttttttt 

agtggtgcga 

tcagcctccc 

tttgtatttt 

acctcgtgat 

cgcccggcca 

gctcctcagg 

gctgtgatcg 

aaaaaaagaa 

gattcccagg 

cttagagacc 

gaaaagtaca 

gtgagtccat 

ctccaatggg 

tgtcacaact 

aaatgcctta 

gaggcagaga 

caatagtgaa 

caatggcctt 

ggccacttga 

aggcctttat 

gctggagtgc 

tcctcctgcc 

aatttttttt 

aactcctggc 

tgagccacca 

tggcactcct 

cctcggcctg 

tgctctgcac 

ctgtgccgct 

gcacgctgcg 

acctctccac 

tcggtgacaa 

tgcagtcgct 

gcctggtcag 

gcctgccctg 

tccccgaccc 

tccatttctc 

cccttctgcc 

actccacaac 

cactgcctct 

cttcacacac 

tctgtgatct 

aaggcctttg 

gagagagaag 

tcactgggca 

gtcatctcaa 

gatgctctga 

aattattgtt 

cccaggaagc 

gagcctccgg 



cacacctggc 

cacagataca 

aaaaccacca 

acccctcagt 

aaacaagtct 

ttcagtcact 

ccccaggcaa 

agaggtcaag 

atgatgggag 

tagcccccag 

ccaaccttca 

cttatgcctg 

agtttgagac 

tttttttttt 

tctcggctca 

aagtagctgg 

tagtagagac 

ccacccgcct 

aaaattttta 

aagctgaggc 

taccactgca 

agaagttttt 

ctgtcccccc 

cgacccttaa 

ggaaagtaag 

ttcacacggg 

gtgactttgc 

ggaggcaggg 

tatagggctg 

aaccctgacg 

aacaggaatg 

ttggcattat 

ggctataacg 

ttttattttc 

aagggtgtga 

tcagcctccc 

tttttcttgt 

ctcaagcaat 

tattcagccg 

gccctggaag 

cctcctgctg 

ctgctactca 

gtccctgcca 

gccaggcacc 

catctacccg 

ccggcacctg 

gcatttgtac 

cctgcagtac 

cccccaccct 

tggcgtgcgt 

tctgtctatg 

tctctacctg 

cttcacctct 

ctcactaact 

ccactccgca 

cacgtgtttg 

cagctgtttt 

gcagagccac 

atgggacggg 

tgcaggggaa 

aaatggaagg 

tattgttgtt 

aataacacac 

ctggaagggg 



tgagcagcac 

tacacacatt 

ctaagcaagt 

ctctcccagg 

tcctcccctc 

gtgtgcccag 

aactgccaga 

ctagttccgg 

gccctgcacc 

gctctctggg 

aaagttactt 

aaatcccaac 

cagactgagc 

tttgatacag 

ctgcaagctc 

gactacaggc 

ggggtttcac 

cagactccca 

aaaaattagc 

aggaggattg 

ctccagcctg 

aagtaactgc 

aggaattctg 

ggcccctccc 

ggcactgggc 

cctcagatct 

cccccagggg 

tgctgctggc 

cccccacaac 

ttagtctttt 

aagagatgat 

tttttaatat 

ttgtcccctg 

tattttttcc 

tcgttgctca 

aagtagttgg 

aaagacaggg 

cctcctgcct 

ggtctaggcc 

cccacctagt 

atgctcctgg 

tccccgccca 

cccagcactc 

tttgggtcca 

ggcactttcc 

cgctcgctgg 

cgctgccagc 

ctctacctcc 

cagccccttt 

ccctcctctc 

tctcttttct 

tttaggtccc 

ctgcctctgc 

tgcctccccc 

tacaccccca 

ccttcagggc 

tctcacccac 

agccactggc 

agaagcccac 

gaaagaaaag 

aataccagta 

atgctgactg 

cctccaaacc 

gtgaaaatat 



tgcatttggt 

cacgcacacg 

gcaatttgca 

gtctctgaac 

tgccttcaaa 

ggaatgactg 

agaaaatccc 

cctagaactt 

cctattaaaa 

cctgcttttc 

tgggggaaaa 

actttgggag 

aacatagcaa 

agtctcgctg 

cgcctcccgg 

gcccgccacc 

cgtgttagcc 

aagtgctggg 

tgggtgcagt 

cttgagccca 

ggcgatggag 

gaatgaggag 

catagttcct 

ggcacaaaga 

accagtgggc 

ccaaagggtc 

acatttggca 

atctagtggg 

gaggaactat 

gacattaatc 

cattcttcaa 

atgagaagcc 

agcccccaga 

cctgaaacag 

ctacagcctc 

gactacaggc 

atctccctta 

tggcctccca 

ttttaccaag 

aagttctgct 

ccctgcccct 

ccgtgagctg 

agcgactctt 

acctgctcac 

gccacttgca 

agcccgacac 

tcagcagcct 

aggagaacag 

ctggtttcct 

tccccaggcc 

ctcttacatt 

ttgctgttcc 

ctgtctgtct 

atctgtcttc 

tgtctgtctg 

actctgcctt 

cctcaagtct 

atcccacaga 

accccttcta 

ggtaaatctc 

attgttattc 

tttgacacgc 

accctgagag 

ccaaattctg 



127680 

127740 

127800 

127860 

127920 

127980 

128040 

128100 

128160 

128220 

128280 

128340 

128400 

128460 

128520 

128580 

128640 

128700 

128760 

128820 

128880 

128940 

129000 

129060 

129120 

129180 

129240 

129300 

129360 

129420 

129480 

129540 

129600 

129660 

129720 

129780 

129840 

129900 

129960 

130020 

130080 

130140 

130200 

130260 

130320 

130380 

130440 

130500 

130560 

130620 

130680 

130740 

130800 

130860 

130920 

130980 

131040 

131100 

131160 

131220 

131280 

131340 

131400 

131460 
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ccctctccct 
ctagttatca 
caatagccat 
tacagtagtg 
gagggaaaga 
ttgcctaaat 
gccacagtgc 
ttttttttac 
catctcctag 
gtgcttggca 
aacgagcctt 
atgtgggtca 
acagtttatg 
ttttcacctc 
gatatagtca 
gtcttctcac 
tagggacatg 
tcatggtctg 
aagcacacag 
tttggacaaa 
ggatagtact 
tggagtgcag 
ttcctgcctc 
tttttgtatt 
gacctcaggt 
ccatgcccag 
atttagcaca 
actactacta 
atgctttcaa 
gggcctgagc 
ctaaccctct 
ttccaccaag 
gctggtgtgg 
acttgaagtc 
aaatacaaaa 
taaggcagga 
actatactcc 
aaaagcccat 
ctcaggccaa 
caggtggaca 
atggtaccag 
gaaaacacag 
aggtgaggaa 
ccaggcctgc 
gaatggggaa 
gtatctacca 
gtggccctgt 
tgttttcttt 
agtggcacga 
tcagcctccc 
tttttagtag 
ggtgatctgc 
gccgacactt 
ttttgttttg 
cgcaatcttg 
tccccagtag 
agagacgggg 
acccgcctcg 
ctgaatggtt 
ccacacctct 
ggacccatct 
aggagatact 
tagctatcca 
ccaaggtcac 



acttgaacct 

atctgctagt 

aacaacctaa 

aacagaacag 

agtaaaatgc 

agtgacagta 

caagcacttt 

ctcctcagac 

ggattttgaa 

tacggtaagt 

atttaacatt 

acttcaagga 

aagtgcactc 

cttgttaaca 

cctccaatca 

catcatgttg 

gaatgaatgt 

gctacaaggg 

tttggactca 

ttacttaacc 

taataataat 

tagtgcgatc 

agcctcctga 

tttagtagag 

gatccactca 

ccaataataa 

aggattaaaa 

ctactactaa 

aatctcctta 

ttgaatcccc 

ctgaacctca 

aatggctgta 

tggcccatgt 

aggagttcga 

attaggcagg 

gaatcacttg 

agcctgggtg 

agttcagtgc 

caacagcaat 

ttggggtctt 

gcatcatggg 

cctctgcctg 

acatgggctg 

cccaaagggg 

gggccaggca 

catgccaggc 

tctcccagag 

cttttttttt 

tctcggctca 

aagtagctgg 

agacggggtt 

ccaccttggc 

gttttctctt 

tttttgtttt 

gctcactgca 

ctggaactac 

tttcaccatg 

gcctcccaaa 

taaaaatagt 

aatttgctct 

gtgaaatggg 

actgagctca 

ttaacaccag 

ccagctagta 



ggaacgtgct 

tggaaaatca 

caaacatact 

accaaacccc 

acggtatatt 

atagcagtag 

ataggtatcc 

ctcagttttc 

agcattaaat 

gctatataaa 

ggtttcagtg 

ttatactaag 

agccacctca 

atggagaagc 

taaatctttt 

agcctcacaa 

tcctgaggcc 

gacagtactc 

gcaagaccta 

tctctcagtc 

cttttttttt 

tcggctcact 

gtaactaaga 

aagggtttca 

cctcggcctc 

tccttattta 

tgtaaggcat 

tactgagatc 

tcaatatata 

gatctactat 

cttactttat 

aataggaaac 

ctgcaatccc 

gaccagcctg 

cggggtggca 

aacctgggag 

acagagcaag 

tgaagaaatc 

caggacctga 

ctgaaatggg 

aaggaagcaa 

tgaactgcct 

gggtgagatc 

agctttggag 

ctgtgccagg 

aatgttccag 

catattccct 

tttgagacga 

ctgcaacctc 

gattacaggc 

tcaccatgtc 

ctcccaaagt 

tcagtcatta 

tgagacagtc 

gcctcacctc 

agacatgtgc 

ttgcccaggc 

gtgctgggat 

ctttatgctc 

atggctttgt 

gataacctgt 

cagcccaatg 

ggaggacacc 

aggaacatga 



tcctctgcct 
ggtcagtgct 
gagcacccac 
ctgccttcac 
ggaaaaatat 
ccgccaccac 
actctgccat 
tcatctgtac 
gcatgaataa 
tgcttgttaa 
aagtggccca 
gtcatgagtg 
tctcatttct 
tgaggctggg 
caaccattgt 
caacctggtg 
acacacccag 
tggagtacaa 
ggttcaaatc 
tccatttcct 
tgagacgacg 
gcaacctctg 
ttacaggcat 
ccatgttggc 
ccaaagtgct 
agaagttttg 
ttagcacata 
aaatactact 
ttagttattt 
tttctgactt 
ctgcaaactg 
gagttagtgt 
agcacttcgg 
gccaatatgg 
ggtgtctgta 
gtggaggttg 
actctgtctc 
atgttattat 
ggtcagcaaa 
aagtgtttgt 
cttcacacct 
ggtagggctg 
cgcagggtgc 
gaaactccac 
tgagttcatt 
gtgccaggga 
actcaagtgt 
agtctcgctc 
tgcctcccag 
atgtgctacc 
ggccaggctg 
gttggattac 
cagtggcctg 
tcactatgtc 
ctggggtcaa 
caccatgtcc 
tggtctcaaa 
tacaggcatg 
aagcagatca 
gcaagttatt 
accttggcga 
tctggtacaa 
aactgaagct 
cctagaattg 



catccagggc 

gatgatgcta 

tacgagctag 

agagatacca 

gtcttatatt 

ttagtgggta 

ttacaagcgt 

aatggggtag 

tttgtaaa^c 

aatactattt 

acttggactc 

agtcccagaa 

acagcccagt 

ggccctgaag 

cggtgtgacc 

atagggacag 

gaagagctgg 

ttgagcaggc 

ctggctccta 

catctctaaa 

tcccactcta 

cctcccaggc 

gtgtcactac 

caggctggtc 

gggattacag 

taaggattaa 

tgggcactat 

acaaattgat 

aggaggaatt 

atttaacttt 

ggaataatga 

atagaaagcc 

gaggccaagg 

tgaaaccctg 

atcccagcca 

cagtgagctg 

aaaaaaggaa 

gaccccatcc 

ggcttgggca 

tctctacgcc 

ggccttttat 

ggctgggaga 

aggtgtgacc 

cagaggacca 

catcaacaga 

ttcaggagag 

agccagatga 

tcttgctcag 

gttcaagcga 

atgcctggct 

gtcttgaact 

aggtgtgagc 

catggttttt 

acccagctgg 

acaattcccc 

agctaatttt 

ctcctgaact 

agccaccgta 

gatctcagtt 

taaccactct 

gcaggggttg 

agtgagtatc 

cagcaaaata 

gcccaggtct 



tagtgcctaa 

atgataataa 

atgctaagaa 

ttcccatgag 

attcttattg 

cacagggtca 

gtgacatttt 

caagagcacc 

acttagaata 

taaaaaaaga 

catcctgaag 

attgcacctc 

tgggagatta 

accctataga 

ggaggcttat 

ttaggggcac 

cgcttgaacc 

tcatttttga 

tatatatgac 

atggcaatca 

tcgcccaggc 

tcaagtgatt 

acccagctat 

ttgaactcct 

gtgtgagcca 

aatgtaaggc 

aataataatt 

catgcattta 

tggagtcaga 

aagcaggttg 

aaataatacc 

catagttcag 

tgggtggatc 

tctctactaa 

ctagggaggc 

agatcgtgct 

aaaaaaaaaa 

tccattgact 

gaggggacct 

cctggcatga 

agaggagatg 

tgccacaggc 

caagatggag 

cagcttttca 

tatttactga 

aacagaaaca 

taaagacact 

gctggagtgc 

ttctcctgcc 

aatttttgta 

cctgaccaca 

caccgcaccc 

gtttgttttg 

agtgcagtgg 

catcttagcc 

tctattttat 

taagcaatcc 

cacagctggc 

tgaattccag 

gagcctcgat 

tgaggattaa 

caatgaatgg 

aaagcacagt 

gtctgactcc 



131520 

131580 

131640 

131700 

131760 

131820 

131880 

131940 

132000 

132060 

132120 

132180 

132240 

132300 

132360 

132420 

132480 

132540 

132600 

132660 

132720 

132780 

132840 

132900 

132960 

133020 

133080 

133140 

133200 

133260 

133320 

133380 

133440 

133500 

133560 

133620 

133680 

133740 

133800 

133860 

133920 

133980 

134040 

134100 

134160 

134220 

134280 

134340 

134400 

134460 

134520 

134580 

134640 

134700 

134760 

134820 

134880 

134940 

135000 

135060 

135120 

135180 

135240 

135300 
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agagtgcagt 
ttggacccta 
agataataat 
gtagcacaga 
gccgatgatg 
cacagagtag 
aattcataaa 
tcacagatga 
aagcagctca 
aagaacagga 
cagagagctg 
ggatgacttt 
aggggcgggg 
ttttaacagt 
ggatttggca 
gaagggaggg 
agaaattctg 
ctctggaact 
tgctcactct 
tgtcctactc 
agggattcca 
tgagtcaggc 
gggtggtggc 
ggacggatct 
agggcggaag 
tgggtgtagg 
gtggcaggca 
gggaacaggc 
agacctcctg 
cagtgaggtc 
ccccagtgct 
aagtcagctg 
ccactggccg 
caaggtcctt 
gacagcgcct 
cctctggggg 
ttcccccagg 
tattgtcata 
cttaacagca 
gaatattgta 
acagctaggg 
cttcataatt 
tctttatgtg 
ccccatttta 
taatggctaa 
ctcatggaac 
ttcatttaat 
tgcccacggt 
ccagtctcca 
gccacactgg 
ctcctgaatt 
aatacacccg 
aggttagagg 
cttctgccac 
tgcctcaggc 
gcggacctgg 
gagcacgtgt 
cagggcgtgc 
aacaacagcc 
ctgcggctca 
ttccagcgcg 
ggccgagacc 
acgcggccgg 
gaggccgggg 



tgttcagagg 
ggcgagtcac 
agtatccacc 
gcttggcaca 
acgatgatac 
acactcggta 
aatcagatgc 
aaagacccag 
tttgattagt 
tgctgtcagg 
tgaaccggcc 
agcagaggct 
ggagggctag 
ggagagagga 
acagcttgga 
gcaggacttt 
tgggccaaaa 
ttggtctact 
cccccaaccc 
ttgctgacac 
agcctacccc 
tgaggccagg 
agcagggggc 
caatgtggcc 
ccagatggga 
ctgggagtca 
tggggaggaa 
cccctgggtg 
gaacctgctg 
agggagctga 
ctcctctaaa 
atcatcagaa 
tgggccacac 
tctgcactaa 
ctaaggggag 
tggagctgat 
gagctcctct 
gtagccacca 
tttatcatcc 
catactgagg 
aaaaggcaag 
ggaccaaatc 
aaccatcctc 
cagatgagga 
caaacaaaaa 
tcatagaact 
ctctacatta 
ggcatcagtt 
tacttcacca 
cttccctggg 
agaaactctg 
aaagtttagg 
aggcagaatc 
ggtgcacccc 
ccattctctt 
ccaacctgag 
ttcgcggcct 
accgcgcggc 
tggcctcgct 
acgctaaccc 
cgcgcgtgtc 
tgcgcgcgct 
gcagccgcgc 
cgcccccagc 



tctctggagt 
ttcacttctc 
tcatagggtt 
tggtaagagc 
ccatcctaga 
cagctctgtg 
agatgggatc 
gcccagagag 
gtcagagcca 
gaagcaggca 
agtggggagg 
ctcagcccag 
ggctgtgaaa 
aggaccggat 
tggggggagg 
ttgcagagaa 
cctggggctg 
cttctgacct 
ccggcccctt 
ttcataaaaa 
ctggatgggc 
gaggtgcaag 
ggagaatggt 
aagaggaggg 
gggggcgaga 
atgccctccc 
agcgtctccc 
aactggcctg 
accacagtga 
agtcctgctt 
aaagtgagct 
tcccctggtg 
acacccgccg 
tgtggcctga 
tagtaatgca 
gcccctcacc 
tttaaagaaa 
tttattgatg 
aaccctcctt 
aacctgagac 
ctggattttg 
cctgtgtgct 
tggaatcctc 
aacaggcaca 
caaaaacaaa 
ccacaaggaa 
tacagatgag 
aatgacagat 
aaccagaagt 
gaacctgtag 
ccccccgccc 
aaccactgac 
caaatccagg 
cttccctccc 
cttctgtgcc 
ccacctcttc 
gggcagcctg 
cttccgcggc 
gcccggcgag 
ctgggcgtgc 
cagctccgac 
ccgcgaggcc 
ccgcggcaac 
cgatccctcc 



tggaagccac 

tgaggctcca 

gtgacaatta 

tcaatcagtt 

ctgatgagct 

gaatgaatga 

ttaaagatca 

gtgcttggag 

agagctggga 

gggatgctgt 

caagggaaat 

agggagggga 

gtcaagagct 

ttgaaagcta 

aggcaatgga 

acaaaaggag 

tgggtcaaag 

cccgaggtcc 

atcgatcctc 

gaggaacccc 

ctggaagaga 

gagccagctg 

ggtgtcagag 

ctcttggcac 

acaggcagga 

ccaacctgag 

caggcagtga- 

agcagagtgg 

tgccctgcac 

ccctctctgg 

gggctgatgg 

agctggttat 

ccccccgctg 

ttaggtgact 

atgtggcttc 

ccaataccca 

agggacagga 

gttgactatg 

tagcctgctg 

tccatgaggt 

aactagggct 

gggcacgtgt 

agaacaaacc 

gagagatgac 

aattaaaaaa 

aggtgttcta 

gaaactgagt 

ccaagatttg 

tctgaaactc 

acatggggat 

caccccgctc 

ctcaccaata 

atgctatgaa 

ccggccaagg 

ccactccacc 

ctccacggga 

gaccggctgc 

ctcagccgcc 

gcgctcgccg 

gactgccgcg 

gtgacctgcg 

gacttccagg 

agctcctcca 

accctctacc 



gttccactgc 
tctcgtaatc 
agttactata 
acctgcttga 
ctgtaagcgg 
ggcacatccc 
cctatcctaa 
ctgcgcaagg 
gtttggaggg 
gttaagattc 
gtggtttttg 
gatagggagg 
tattaatgca 
cattcaagga 
ccccaaggca 
aggagaggag 
gcacctgaat 
cccaaaatgt 
tgaccataca 
atttaggtgt 
acaagagcac 
gaggcctgag 
gcagccgaga 
gctcagttcc 
gcacaggaag 
gcctccgacc 
gggagggaga 
atgctcctgt 
aagaggggag 
caagccctta 
gtgccaaggc 
aatgcagagt 
ttaattctga 
ccctagcacc 
cttcctctcc 
gcctagtagc 
cccaattgtt 
cacctgccag 
agggggttat 
taaaacttgc 
ctaagtgctg 
ccagcacttc 
caggaagtag 
tggcttggcc 
aaaaaaagaa 
agcaccttca 
cacagatatc 
aaatcagaaa 
aaactgtggt 
tcccaggctc 
agagatccgc 
ccactttttc 
tcaaaaggtc 
ccccagcggg 
ccacccagga 
accgcctgcg 
tgctgcacgg 
tcaccatcct 
acctgccctc 
cgcggccgct 
ccaccccccc 
cgtgtccgcc 
accacctgta 
gagatctgcc 



atattagctg 
tctgaaatgg 
taggatctgt 
caatgctgac 
gggtgcctgg 
agaactcacc 
gtcccttgtt 
tcacacagcc 
aggcaaggtt 
caaatggatg 
aaatggaaga 
ggagataggg 
tagagaacgg 
agtggcaacg 
gaggctcaga 
gttagaatca 
tccctaggat 
ggattacccc 
tctctgggtg 
tttgagtggc 
caggccatgg 
ccaggatttg 
aggttgaggg 
tgtagcgaag 
gtggaggctg 
aggctcctgg 
gccacagtca 
tctgagaccc 
gacctcaagg 
tctctttgag 
attagctccc 
ccaggaatcc 
accatagttc 
aggcaggtgg 
tcccctgccg 
agtactttgg 
actgagcccc 
atactgtacc 
acataataag 
ctaaaataac 
agcctgtggg 
cctcatatga 
gtatactcat 
aagttaagaa 
taatggctaa 
tacatgctgc 
ctgagtgact 
ggctggctcc 
cctgccaatg 
caccccaaac 

aggggatcct 

cacagcaaat 
aaccctttct 
gtctgcaccc 
tgacttgttc 
gctgctcaca 
gaaccggctg 
ctacctgttc 
gctcgagttc 
ctgggcctgg 
ggagcgccag 
cgcggcaccc 
cggggtggcc 
tgccgaagac 



135360 
135420 
135480 
135540 
135600 
135660 
135720 
135780 
135840 
135900 
135960 
136020 
136080 
136140 
136200 
136260 
136320 
136380 
136440 
136500 
136560 
136620 
136680 
136740 
136800 
136860 
136920 
136980 
137040 
137100 
137160 
137220 
137280 
137340 
137400 
137460 
137520 
137580 
137640 
137700 
137760 
137820 
137880 
137940 
138000 
138060 
138120 
138180 
138240 
138300 
138360 
138420 
138480 
138540 
138600 
138660 
138720 
138780 
138840 
138900 
138960 
139020 
139080 
139140 
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tcgcgggggc gccagggcgg ggacgcgcct actgaggacg actactgggg gggctacggg 139200 

ggtgaggacc agcgagggga gcagatgtgc cccggcgctg cctgccaggc gcccccggac 139260 

tcccgaggcc ctgcgctctc ggccgggctc cccagccctc tgctttgcct cctgctcctg 139320 

gtgccccacc acctctgact gcggtgctga gatcgaagag gccagtgtcc gatccccgct 139380 

tcccgtccac ccggggctgc ggctccggcc ccagtcgccc caccttccct ggccttgctg 139440 

cctccctttc ccctcccagc tcctctcctc cccggggagc aggccgcctc tccttgcctg 139500 

ccccctgggc tgtcctgact tgtggcagcc ccaagagggc gtgtgtggtg gctcagccct 139560 

gccctcccca gttctggcca ttaactcttc cccatcccaa ggctggggtg gggcccccca 139620 

ggcagccgct gacccgcact cctaagggcc cacagcggac accagagggg cttttgtctg 139680 

cagagcgtct tccaccagca gagcctttgg aagctccccc agggagcccc acccaggacc 139740 

ctttggggga tgcctcagtc agggccaggc tgaccctgac ccctgcttac cctagtcccc 139800 

tcaacctcct gacactggag gaatactttt ctcctaagtc taccctggac actttttagg 139860 

gcacctggag agaactttcc tctccactgt ggcccctgcg tggtgaagat caaaagaagt 139920 

tgtttgggaa aaaaaattta ttaaaaaatt ctattatttt atctactgta agatttgttg 139980 

acttgggacc ccgaaagcgg gatgaggtct cagaatgtaa ggattgcagg gccaggaggg 140040 

ttggagaagg ggagccgtcc cccgccatca aagagcttcc tggtggctgg aggtggtgtg 140100 

cgctcccccg ccatgaggag gagctgaagc cctgcattct aggtgaggcg cagtgtggca 140160 

gccaagagtg ggtgctggtg gcacctcttc tcttcatttg tccaggggaa gagctgcagc 140220 

caaccctgag tggtctggcg cctgaggaac taagcctggg gaagacctgc tgtctggtta 140280 

acagccctct tccagaccct gttccttcag gaaacaagag cagttctcct gcaaggagga 140340 

gtcacataca cactcctggt cacagacagc cccaacatgg ctttgggtaa atgtgaacaa 140400 

ggcactgctc cctcagggaa acacagcccc atgccagagc aaacacctta gcaaacagag 140460 

accaaggctg ggtttccgcg tacacttgcc tccttggcta agtgcccttg tgcagtgcac 140520 

agcgtacaca cctgcacaca gcaaccctgt gggtatgtgg tctctctctc agctcctgtg 140580 

aggtagaagc catcagggat gaaccaggtc agagaagcag gtttccaaac aggctagaag 140640 

agggaccgag gaactcgggt gatcagaggg acaggaatcc caaattggga tgcattactg 140700 

gcttgaggta caatcagaac cttcatcttt ctggtgtgtg- gaagagaggc tggggactgg 140760 

gaagagctca ggctaagaag gacttgggtt gggatttagg ggtgagtctc atcagactga 140820 

gcacttggag agaagtttgg tagtttgaat ttggagctaa gaatctagct tgggcagggt 140880 

gtggtcgctt gcacctgtaa tcccagctaa ttgggaggct gacgtgggag gatcacttga 140940 

ggccaagaat ttgagactag cctggacaac atatcgagac tgagtctctt aaaaatgttt 141000 

ttttaagaat ctagtttgga gtggggtgtg atgtctcaac gtctgtaatc ccagcactct 141060 

gggaggctga ggtggacaga tcacttgagg tcaggagttc aagaccagcc tggccaacat 141120 

ggcagaaacc ccgtctctac taaaaattca aaaaaattag ccaggcgtga cggcgggtgc 141180 

ctatagtccc aggtactcag gaggctgagg cacaagaatc actccagcct gggtgacaga 141240 

gactctgtct aaaaaaaaaa aaaatctagc ttgggaggtg ggaatagaaa gatagagggg 141300 

gcctagatgc tagggcttga ggaagcaggc tgaggttctg tgattctggc tagggaggtc 141360 

aaatgatctt gagaagaaga gaagaaagga gaagaaatca gcatctaagc ctgaggcagg 141420 

tagactccgg ttaagggtgt ggggtgggct gggggagagt gagagcagct ggtcagaaac 141480 

ccagggagct cggagtctgg ggtcttgcag gggcttgtgt caggctggct gtgaggaggt 141540 

taatgggttg gattggaggg acagccagac aagagctctg gtggaggagg ggctgctggg 141600 

gcctgggcag ggggagggga gctgctggta aattagaggc aggctgtcca ggtcatagaa 141660 

ttatcattgt gaaatattca tgggccatcg gtccagatgc tatttcagaa cagtgaaagc 141720 

aagaggagtg tgtgagcctc aggaagaagc ctgaagcaaa gccactctcc accaaccccc 141780 

acccctccca ccaccagccc agacagaccc acggacgccc atcacgtgca cacccacact 141840 

cccgagctct cacacacact cgcaccaagc agagccatgt agcacgtgca agcacaccaa 141900 

ccacccacgg gtcccacaaa caggcaggtg tcccctaaat tctgacatgc acactgacat 141960 

gcacacccac tcaatcagga cccagcagag atcacctcca gcgatctcac atgcgcagac 142020 

ccccaaactc tccaaacaac ccagattcac caccttgacc cacacaccct gagataggag 142080 

ggatgttcaa ggccatccag cccaaccccc accaatgctc tgatggggaa actgaggcca 142140 

tagaaaggaa gggatttgtc tgagattcct ctatcccctg aaaaaagcaa aattcattca 142200 

cctcccacat tctgagtgta cccccattct gcattttcgt ctgccagaca cccagcctag 142260 

ttgtaattaa ctcctccctt tctctaattt cctgcatcta ttcagttacc cagtccccca 142320 

cccagccaca gtctatccct tccttcccat tctccccacc acctccctgc tccagctact 142380 

cattacctca tgcctggaat ataaaagaaa actgcgataa cctcctcgct ggtttcctac 142440 

atggaatctc tccctccctc ccacccagcc ataccgtggt gaccagattc atctgatcaa 142500 

aatttgcata tgttatgatg tcactcagga gcctgtaatg gcttcctaat gcctataggg 142560 

taaaggtaaa acaccttagc agagcatcaa agatccctca gagtctggta ccaactgctt 142620 

ttctagcctt ttctctcaca atctcatccc aaaccttcac tccagctaga acgtttgtat 142680 

catactggcc accagttatc atgtatgtga aacccaccaa ccgactttga gtgcccccct 142740 

aaaatttctc agtctctcct gaagtaggaa acctcttccc cctcctcaga tctcagactc 142800 

cagagccctt tcccaaggcc aagactgcac ctctctgacc atatacaggg gttcttcaaa 142860 

gcagcagaca gaggctcagg ctctggctcc ctccaagcag acggctgccc ccgactggcc 142920 

accttgggaa gcacagccag gtttcagtcg tctagaacag agaatgagca tctaaccgcc 142980 
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tggggagagg actaggacac cagatgataa ggtttataag cccttaagcc tctaaggttc 143040 

ttacacccag agtagggggg ggacggttct cagccctgtt tccctagctg cgggctccca 143100 

attttcgatc cctaatccga gaggaactcc tctccaatga aatacagact tgggactctc 143160 

aggacactgt ggaagggaaa tttcccaaca gactctgaga gtccaggagg ccagggatag 143220 

accaggtggc aggcccaagg tccagctggg gtcaggtttc tatatgaatt tttaatgctt 143280 

ccagatagac ttgtcagatg ttctgaaaac tgagcatctc ctttcacctc tgtacatgat 143340 

gcccttctcc aaccccattg cccctgcagg agggcaggcc tgggacagat attcagtggc 143400 

ctctggagaa acggttttgg gacagtagaa gggtaaatga cctagttatg ttcccactag 143460 

taagctgtgt gaccttgggc aagttactta acctctctga acattagagt tctgtgggtt 143520 

tgtttttgtt ttgtaagctg gggacaatag tgccagccta aatcaatttg ttgtggggac 143580 

tcagtgcaat agcccatggc aaagtgacct acatgcttgc tgttattatt ctctttcctc 143640 

aagttctgcc tccctcttcc agcttttctt ccaaccccaa agatgtctct ggctattgct 143700 

tcgaaggtag gaactttggt tggttctccc ctttctcttc aggcccaaac tccccacctc 143760 

aagatccttt ggcctttgta gaaacttcag gtgaggaggt ggcagagaaa taagaaagtg 143820 

tgcaaggctg gtggagtgag agaggaggat agatggcgaa gccctagcag aggggaggga 143880 

agtgggcagt ggagagagg 143899 



<210> 16 
<211> 215980 
<212> DNA 
<213> Mus sp. 

<220> 

<221> modified_base 

<222> (1001) . . (1100) 

<223> a, t, g, other or unknown 

<220> 

<221> modified_base 

<222> (2123) (2222) 

<223> a, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (3728) . . (3827) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (5168) (5267) 

<223> a, t, c, g, other or unknown 

<220> 

<221> niodified__base 

<222> (7481) (7580) 

<223> a, t, g, other or unknown 

<220> 

<221> modified^base 

<222> (8849) (8948) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (10375) (10474) 

<223> a, t,, Cf g, other or unknown 

<220> 

<221> modified^base 

<222> (12270) (12369) 

<223> a, t, c, g, other or unknown 

<220> 
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<221> modified jDase 

<222> (13438) (13537) 

<223> a, t, c, q, other or unknown 

<220> 

<221> inodified_base 
<222> (15902) 

<223> a, t, g, other or unknown 
<220> 

<221> modified_base 

<222> (15939) (16038) 

<223> a, t, c, g, other or unknown 

<220> 

<221> inodified_base 

<222> (18223) . . (18322) 

<223> a, t, c, g, other or unknown 

<220> 

<221> inodified_base 

<222> (20974) (21073) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (24403) . . (24502) 

<223> a, t, Cr g, other or unknown 

<220> 

<221> modified_base 

<222> (27574) . . (27673) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjaase 
<222> (30892) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (30901) (31000) 

<223> a, tr o, g, other or unknown 

<220> 

<221> modified_base 

<222> (34443) . . (34542) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (38205),. (38304) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjoase 
<222> (42373) 

<223> a, t, g, other or unknown 
<220> 

<221> modified^base 
<222> (42386) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 
<222> (42393) 

<223> a, t, c, g, other or xinknovm 
<220> 

<221> modified_base 
<222> (42461) 

<223> a, t, c, g, other or \inknovm 
<220> 

<221> modified Jaase 

<222> (44809) (44908) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (51380) . . (51479) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (56740) 

<223> a, t, c, g, other or unknown 

<220> 

<221> itiodified_base 

<222> (56765) (56864) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62818) (62917) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (68518) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modifiedjaase 

<222> (68534) . . (68633) 

<223> a, t, c, g, other or unknown 

<220> 

<221> iaodified_base 

<222> (74552) (74651) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjaase 

<222> (81446) (81545) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified__base 

<222> (88519) (88618) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (93791) 

<223> a, t, c, g, other or unknown. 
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<220> 

<221> modifiecMDase 
<222> (93794) 

<223> a, t, c, other or unknown 
<220> 

<221> modifiedjaase 
<222> (96565) 

<223> a, t, c, 9f other or unknown 
<220> 

<221> modified_base 

<222> (96570) (96573) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96579) 

<223> a, t, C/ g, other or unknown 
<220> 

<221> modified_base 
<222> (96590) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96596) 

<223> a,, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96602) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96616) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96629) 

<223> a, t, c, gf other or unknown 
<220> 

<221> modified_base 
<222> (96633) 

<223> a, t, c, g, other or unknown 

<220> 

<221> niodifiedjaase 
<222> (96668) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (96715) . . (96814) 

<223> a, t, C/ g, other or unknown 

<220> 

<221> modified^base 

<222> (104447) . . (104546) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modifiedjaase 
<22*2> (114521) 

<223> a, t, g, other or unknown 
<220> 

<221> modified_base 

<222> (114527) . - (114626) 

<223> a, t, c, g, other or unknown 

<220> 

<221> mcdified_base 

<222> (127063) . . (127162) 

<223> a, tf o, g, other or unknown 

<220> 

<221> modified_base 

<222> (139133) . . (139232) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (151051) 

<223> a, t, g, other or unknown 

<220> 

<221> modified_base 

<222> (153242) . . (153341) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (164706) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (164708) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (164710) (164809) 

<223> a, t, o, g, other or unknown 

<220> 

<221> modified_base 

<222> (182242) . . (182341) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (192158) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (192192) 

<223> a^ t, Cf other or unknown 
<220> 

<221> modified_base 

<222> (198842) (198941) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 

<222> (199437) (199438) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (208276) 

<223> a^ t, other or unknown 

<220> 

<221> modified_base 
<222> (215974) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 

<222> (215976) (215977) 

<223> a, t, c, g, other or unknown 

<220> 

<221> niodified_base 
<222> (215979) 

<223> a, t, c, q, other or unknown 

<400> 16 

ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc taacttcaag 60 

caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga agggttccca 120 

cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct gcctaatgac 180 

tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta atgtgggaag 240 

gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga tgttgattca 300 

agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt tgtgggagtt 360 

cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat ttgcttccat 420 

tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga ttctcaacag 480 

tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat tgacacatcc 540 

cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat agttttagct 600 

ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag tctaacttca 660 

ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt tccttgagtc 720 

tcaacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt tccttaaaca 780 

taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta aacacaggaa 840 

gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta ggtccctcct 900 

tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc agctcagctg 960 

ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn nnnnnnnnnn 1020 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 

nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct gtttatagcc 1140 

tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc tttctttttc 1200 

tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg tctagaactc 1260 

acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta caagcatctt 1320 

ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc gggctgaaga 1380 

taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg agtttgatcc 1440 

aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg aatcagggat 1500 

gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat agccaggttt 1560 

cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg tgacccaact 1620 

ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc atgcatgtcc 1680 

atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt tggtaatgca 1740 

cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt cggcagcacc 1800 

aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg aaatgaaaag 1860 

cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac atgagacttt 1920 

gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac ctgaagctga 1980 

cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca cagacacccg 2040 

catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc ccattcatag 2100 

ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 

nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg atggggcact 2280 
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ttgaactccc acctaggcaa aaaacccagt gatctctggg ccagtgtgtt gtttgcaagg 2340 

gaataaggta gagagccgcg gaggaagaca ttgggggttc tatgagtatg tgaaggggtg 2400 

cacacaccac acacacacat tttttttgtt ttaaatttac aaacattaaa ataggctgta 2460 

atgtggctca gtgggtagaa aaacctgctg tctaagcctg gtacgagttc aatccctgac 2520 

aagctggaag gagacaacca accacaactg ctagcagcca gaagcactgc ttgctaacac 2580 

tcaagagagc ctggagtgga agacactgga tccccagcag gcaagcctgc aagaagatgt 2640 

gccttgccta gacaacggca gaacaaacat caaggctggc agagctgtcc aggactgttc 2700 

atattaatca tgtatagata agagggaatg gcacagacag aacaattcaa cacacggggt 2760 

atgaaaggaa aggaacaagg cacacaaagg acaaagaacc tagcatacaa gaaagcctaa 2820 

gcagagagtg gcacttccca gaagggagtc ataaaataga ctgaattcat taaaacaaga 2880 

gccaaagata aacggctcaa aaaactcacg gaaaacaggt caaaataacg tcacccatct 2940 

gacagttgat actgtcaact taaccgtatc tagaactcca gcaggcacat ctccaggcat 3000 

gcccctgaag gggtctttgg actaggttaa ctgacgtggg agtgacacca tctatggacc 3060 

gaagcctcag acagaataaa aaggagccag tgagctgagc gtcagtgctc attgcttctg 3120 

gcttcctgtc tgtggctgca gcgagacacg gtgcttcctg ctttagctgc catgacagac 3180 

cacaccctca aaccgtgaac caaaataacc tcctctctac attgctttta ccaggcattt 3240 

ggtcacacca atgagaaagg ttaactaata cagcactcaa tacttaaaaa cataaacacc 3300 

aaccttgttt gcatgtgtga gactttgaag ctcacgggcc agttatgccc aatgccaggt 3360 

ctgctggcta agggtgagag tgcacaccta taatcccagc tgctgtggaa tcagcaaaag 3420 

cgctacagat ggaaggcagc cagggcagct gagactgact caaactgata gaggtgggag 3480 

gcatagagaa aaccagatta atagagtgtt ccccactatg caagaagccc tgggtttcag 3540 

gacgagagaa ctaagaatac agaagtctac tgtgtagaag cactgctagg tcacacagaa 3600 

acatcactca agtgtctctg gatgctacac ggagggcgtg tgaagtattg cttcctgatg 3660 

atctgtatct actacagcac tgctgtttta gtatgcgctc ctccactaca gctcctcacc 3720 

acaccaannn nnnnnnnnnn nnmmiinnnn nnnnnnnnnn nnnnnnnnnn nnnniinnnnn 3780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat taatcaaaga 3840 

aaacacacac caccagttag agaaagttaa tcaggccgaa- tggcggcttt cccctgtatc 3900 

caggctaccg tcaggacggc tcactgccac tggcaactct gcctgaacaa agcccgcagc 3960 

caacgtgggc ttcaggggct ctaaacactg caatcaaagg ttgtgtgtgg gggtgggggt 4020 

gctgctgcta ttcaaggatt cccaaagctt agatgtattc atcatactca caggaaagcg 4080 

tgttcaaccc atcactcatg agcagtcggt accggggtga cctattccct gtagaaatgg 4140 

gacggatgtt ctggaaaagt tgacagaaaa gttgattcat taggcaggct ctttgcccaa 4200 

gccctgaggg taagcaaagc taactggcag gagactaggt ttgccattaa tctgagacaa 4260 

gatgaaccac ttgcccatcc tcctgacacc taaatactaa tgaaagaaca atggattgag 4320 

ctggcattat taaaaacgat agaaacagaa gtatcaatag tcatgtgttc tttctcccat 4380 

atgtcaaaac aatgtgtaag atggcatcga acacatgcag aaactgttta gggaacatgc 4440 

tgaaaatatg aagtaaaatt aaaattggaa agaaagacaa tttgcctaaa gcagctcaga 4500 

gctggagaag ggaccgaggc agagataaca gcaacgtgtg gacatacgga tctggggcag 4560 

agcagtcacg gactcagccg gaaagggtgg ggcagcctct gaaggaagtt aaggtaaata 4620 

gagccacaag gtgattggcc caggagtggt gccaccttca cctcctgcct caaagtctga 4680 

aggaatgatc ctggagtctc ccatctattg atatatgaaa ttcacagtat gttttagaac 4740 

ccactgaatg atgggtagat taactaaaag aaatttaagc ggggtggtgc aggtctttta 4800 

atcccagcac ttgggaggca gaggcaggtg gatctctgtg agtt5gaggc cagcctggtt 4860 

ccaggacagc cagagataca tagagaaacc ctgactcgaa aaaacaaaat taaaagctca 4920 

tcaaaacaac aacaacaaca aaaaaaacaa aaaaacaaaa caaaacaaca aaacacccta 4980 

tagtacctgt tggtgagttt gagtgagtga gtgagtgtgt gttagagaga ggggcgggga 5040 

aagtgtgttc tggaaatggg agaaagagaa tgtgcatgtg tgtttctggg atgtagacaa 5100 

aactacatgt cttccatcaa atgcaatgtt taattatcta tgagttgaac catcttcatt 5160 

ctgctaannn nnnnnnnnnn nnzumnnnnn nmmnnnnnn nnnnnnnnnn nnnnnnnnnn 5220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaac aaaaataaac 5280 

caaaccagta aacaaaatcc tgtaagataa agcctaagac aagacacttc ctggggctgg 5340 

ggagttgctt agaccataag gagttcataa tccaggcgtg agagcccgag ttcaggtccc 5400 

tgggcttcca agtcaggagg agaccaagga atcaacaagt ctcgactttg gtctctagtc 5460 

ccatgcacac acacgcgtgt aaatacgtag atgttcactc acacacagaa gactgcacct 5520 

ggctctctca catctcagcc aacatataaa gcctgcatta tcagaacatt ctaggttcta 5580 

gtttcagtca actcttacac agaatggcca tcatactccg tctacaactt ctcctgatct 5640 

acccacgtgt cattgcttca gtattaacaa aacccagaat aaccagctgc gtagatcctc 5700 

cctgatgccc cagtcattgt cttactgaga ctactaagtc acaaggtagc actctggatc 5760 

caaaaagcaa tatccaattg agagttacaa cctataagga ggagtttacc ttcattatag 5820 

ggcactggat tcccaatctt taatccaacg tcttcagcag atttcataac ttccaagtcc 5880 

atcaaaacaa ctactttcct acaaagacag acacaagtta gaattaagaa ctctgcagcc 5940 

tttcagatga gttactaaga agcttacttt agtagttgtc tggctaaaac tgtatccttt 6000 

accaaccttt tctcattctg gactaacttg agaagtatta attcctaagt aaatacttca 6060 

cttattcttt ccccacatct ccaatgtttt tgtctttaat ttattatagg gcaattcatt 6120 
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tcctatctag ttccctgatt aaaacagtag accttgctgc atgccattat cctcatggag 6180 

gcactgatac aatttagatt attaaataca aaaccctaaa acacaaaaag atgatttttt 6240 

tttaaaacaa gattttaaaa aaagcatgtg ctacgcttcc ttctgccact aagcctacac 6300 

atggtcctct gactgaattt ttcccctcat tctgcttcat ctaatatgtg cttttcaaac 6360 

ctggaattga accagggact tattcatgct aggcaaatgc tctaccatag agctataccc 6420 

ctccaactcc catctcaaat atcatttcca aagacatttt cttggtctct tatttagatc 6480 

aggtttcttt gtcctcctgc agctatgact tcattccttc agaacactcg tcttagcttt 6540 

aagttctgta ttaattagtg attgttttca ttctctctgc tagaatgcac tttcaataaa 6600 

ggcaggtagc cagccacagt gcttaattaa gcaacagccc aacgatgtca ttcactacat 6660 

actgggacaa gatgcctaac atcatctgca gataaagacg aactaptggt gtcaggagac 6720 

agctaagggg tccagggctt gggcacgctg agtgtgagca ctggagtccg ggtgcccaga 6780 

aacgcacata aatgcaatat ggatgtggca atctacctct aattccttct ttaagacagt 6840 

ggctctccag agcaagctgg ctagcaagac aagccatatc agtgagctct gggcttgacc 6900 

aagaccctgc ctccaggtgt aactcccaag caaaaggatg atggctcaca aatctcaggc 6960 

tatcatgttc atgtacaaaa tgtcaaccgg catacacaca tgcacacaca tgaaaactgg 7020 

gagaaaataa gaagaattgc aaccaaaaaa tgtaatttga ggacacataa ttgcaggcgg 7080 

ggagtggggg gatgacagaa ggtgaactga gtggaccgag ggaaagctgt gctagcggca 7140 

atgagaagaa gggtggggca gtctgagcaa gggttcagca atcaccacgc tttactgtct 7200 

gcacagcctg gctgtagaat gctgggcttt atcacacaga attattcagt atgtgctatc 7260 

tttacagtaa agttattcta tcaggctatg ctacttcaat agaacaagcc tgaaaaagtg 7320 

gtctgctgct gagaacctga caaagatgac ctgttagaac tgtctgccaa gtgtggaatt 7380 

ccagcactgg ggaccaggag ctcgagggtc accccagatg cagggagtta gaggccagtc 7440 

ttggcaacat aacatcatgc ttcagaaatt aaaaacaaaa nnnnnnnnnn imnnimnnnn 7500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7560 

nnnnnnnnnn nnnnnnnnnn catgagatag ttaataaact gaagaaagcc atacaaggag 7620 

taaagtagat agttgcaagc atgaagaaag acaaaccact tgagcttttc ttttgtcgta 7680 

aggaggaaac cagacaggtc cagagagatg gctcagagat- taagagcact gactgctctt 7740 

ccgaaggtcc tgagttcaaa tcccagtaac cacatggtgg ctcacaacca tctgtacagc 7800 

tacggtgtac tcatatacat taaataaata aataaataaa taaataaata aatcttaaaa 7860 

gaaaaaaaaa aaaaacctaa ccaatcagcc aggcgatggt gacacatgtc tttaatccca 7920 

gcacttggga ggcagagaca ggtggatttc tgagttcgag gccagcctgt tcttcagagt 7980 

gagttccagg acagccaggg tgatacagag aaaccctgtc tcaaaaaaca aacaaacaaa 8040 

caaacaaaca aacaaaaaag gaggaagcca gacaggatgc actttatacg tgaatggaat 8100 

tgacaaaaga caagttctat aagtgttagg gaaaggggga ggacaacggg ggttcatgtc 8160 

tgtggtggaa cacgtattag aaggctctgg gtatcctgtt tccgacaaac aggcactccc 8220 

aatcacacag gccactggat gtctcaggca gagaaagatg tgatagattg actttttaac 8280 

aatcacagac tgtgtggaaa atatttgtaa ggttgtcatt gtcacccagg atagagctga 8340 

tggttattca aacgaggatg ggacaacaga aatgggagag agggatgtga gaaccatttt 8400 

gaaccagggt gatttactgc gcacgtgtat agggtctaca gggagtggga tatgtagagg 8460 

aggcctatgt tcctaacttt ggtaatgagc ttattacagt tactatgcac agcctggaag 8520 

atactggaaa aggtgcaggc taggctagaa aggtactaac tgagggtttg acagcccctt 8580 

ggatgtcagg atgcagcaag cctacctctg tatgtagtca atcccttctc aggctatggg 8640 

tcctgcagat catccgtctc tgtatccatt attcccagtc catcctctga gtggctccct 8700 

cttatccagt ttaacaaaat gctgactgca agctcccaag cccagggctc tggctccttt 8760 

actccttgtt attgtacttt accctgtttg cttgggatag agtgtgccct ttataaacat 8820 

ttgtgaaagg gggaatgaag aagaataann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8940 

nnnnnnnnag agagctcaat ggttaggagc actggatgct cttccaaagg tctccagttc 9000 

aattcccagc atcaccatgg cagctcacaa ctgtctgcaa ttccagttcc aggggattca 9060 

acactcagaa acataagtgt aggcaatcta cgtaacataa aaataaataa atgagctgga 9120 

aaagaaaaca tgtttcaaaa tatacaagta atggggctgg aggagatgtc tcaatgggta 9180 

agatcattgg ctgctctttt ggaggttctg ggttcaattc ccaccaccca catgacagct 9240 

cacaactgtc tgtaactttg gtcctgtggg agctgatgcc ctcttctggt gtgcagacat 9300 

acatgtagac aaaacacctg catacataaa ataagttttt aaaaaagtta cacatacacc 9360 

cgtgtgtaat ataacacaca ctggcttaac ttcctcagca ctgactgttc accatacgga 9420 

ttcccatgag gttttggttg cattctatca ccgaaaaaaa aaaaaaaaaa ttagaagaaa 9480 

gtatatacat ataaacctct ccctaaaata aagttttctt ttctaaaagt acatccttat 9540 

ttttttattt tttttttttt ttaagaaatg ggaacaacag ttctgctcac actgtatttc 9600 

tagcatgtaa catcttgcaa gtacttaacc gtattctata tcagctcaac acacttacta 9660 

ccgaagactc aagatcacaa aaaaaaaaaa aggacccaga ctggataatt aaacgtttct 9720 

tttgttgtag taagcgacct cttccttaga agatactaca gtaatgctga agaaatgaca 9780 

catctactgt aatctgttct ctgggattcc aacttgtttc ctctgctact cctcccttgg 9840 

cggcaatgtt cgtctgcatc cggctgagct cctcgctgcc ttgttaaacc tccttcctga 9900 

acttccgacc tgtagttccc gctctacagt gcaagcgagt ggataaggaa gcgcatacct 9960 
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gccgtctttc agggtgttga cgatgaactt gtggacctgg cagacacagt tgctggccag 10020 

ctgccctccc tcgaccaggg tgttcagctg cgtggccagc atgaacgctg caaaagcaga 10080 

gagagagggg ctcagtctcc aagcctttcc ttaacccgaa agctcatcac aaggagaacc 10140 

attaaataca gctgtttaaa actcctccgc cctgcagaga ggaaagcagc atcaatccgc 10200 

cccatgtaaa agtctgaggc tcttcctaaa tggtatctgt ttctcacagt ctccaaatca 10260 

tttttactgt aattctagtt tctggggaaa gacctttctc ggtctttagc cccgtgacta 10320 

gagacaacag gcaaatattc cagaaaggcc cccattttct ttttaaagct tctannnnnn 10380 

nnnnnnnnnn nnnnnnnnnn minnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngcacat cttgtgaagt gtccacatct 10500 

ttcggtccct cgaatttggg tttcttctgg gacgtggtag catgtgactg tcactccagt 10560 

gcttggagca gcagaggggt caggaactcc aggctggcat tagctgcaga gctggagcag 10620 

gtcctggaga acagaaactt tggttgcagc attaatgaac tagaagaatt tttttgtctt 10680 

ctgttaaata taaatacctc cattatcttc tcataaacag tgttgccttt ttatttaagt 10740 

ttttaaggat caggcacaga gactccatgc cagactacca ctcaaccact gagctacacc 10800 

cccaacttgc ctttctgcta ttttttaaat tgtatcagtg gccaccaaac atggggagag 10860 

gtcagggggc tacgtggagg aattgtttct ctcctaccaa gtgggcccca ggtttcaaat 10920 

tcaggtgacc tggcttggca gcaagcacct ttacccctaa gccatctcat tggcttcatc 10980 

ttttaatggc cccttcccct gctctgaggc aggctctccc tatatagccc tggctggcct 11040 

caggctcgca ggtccaccag tgagcaccag gtttctgctt gtccttacct ccccagcact 11100 

gtggttataa gcatgtgcca ctgtgtcaaa ctcagtcact aagctttgcc aagccatagc 11160 

ccagcccttg agtttactgt ttgtctgtgt ggtgatttgt caaaccactt ttgttccact 11220 

gaggtatttt gtcaagtttg acaaaattag ttgagtatgt aggtcttttt ttctggaatc 11280 

ttctgttata gcttagtctg gtcttgaact catgatcttg cctcaacctc acgattattg 11340 

aggattattg agatggacag gctgtgtgac catgctcggc tgtgtgtttt agcatgcatt 11400 

agtcatttga aaaacgttgg ctcatgacac tttacaggtc ttccatgttt gatatgtttt 11460 

atttaatcca aagtaattcc agcaccagag gctgagacag gaggatctca aggtcaacct 11520 

agagatgcat agcaggcggg gccccactcg gttaggttaa- tatcatcact gacttcagga 11580 

gaaaagtctt aagtattggg gactaaaagc aggaggatct gaagttcaag gtcatcttta 11640 

ggaacttagc agacttgagg ccagcttggg cgctgtggga ccctgttttt aaaccagaaa 11700 

acaaattgaa aggaaaaaaa aaaaaagctg gaggaagtga atgtgagtgt tcacatagtc 11760 

ctgtttccac aagaaaacag ggttactttt ggcaacaaat aggtgctttc tttgaaggct 11820 

ggcatttttg tgacttgtca ttggagaaat gatttaatta agacttttct actgagtgcc 11880 

tctgaagagg ctcttttaaa tttagtttaa ttttatctca ttgttagtgt ggtgtgcttg 11940 

tgcacacaga aggcagcttt ctagagtctt ttcactctct cctccacagc tcctggagtc 12000 

aaactcaggc cctggctagg caagctctta ggacagtgtt agctgtagct tattaagttt 12060 

ttaagaattt ttataagact ctgtttttct ttctcaggtc atgatacagc aggaaaatac 12120 

atccataaag cccatcctgc aggtcattgt aagtaccggc atgtgtgttt agcataatga 12180 

agatggttca cttatagtta attaaacatt ggattggatg gaagacatgt agttttggtt 12240 

acttcccaga aacacaaatg cacattcttn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12360 

nnnnnnnnng aattcagagc tgatatgtag tactaactcc tactcaatga atcctttgtt 12420 

cttctattcc ttcattacat tactgttaat agtggtaact atgtaccaaa gagtcaaata 12480 

actcttggac catccaaggc agaaggaagg ctggcaaaaa tgtatgatga tctgggatgg 12540 

gaatgtactt cagtttgtac aggaggccct tggttcattc catttctggc aatgcataga 12600 

cctgtaggat ctcagcactg gtggggggtg gggggtgagg gtgaaggggc gggaggttaa 12660 

aggcagaata gtcataaatt caaagtctgg gtcctggaaa gaggactaaa cgattaagag 12720 

ctttagctgt tcttctagag aacctggtgt gatccccagc acatggtgcc tcacgactgt 12780 

ccgaaactct gattctaggg ggatctgaaa accctcttct gccctctgta gatacagaac 12840 

acacatggtg cacatacata catgcaaccc aaacaaccca tatacataaa atattttttt 12900 

ttcaaaaaga cattcaaatt cttcctcggc tatatagtgt ttaccaaacc tcaaaaacaa 12960 

aacaaaacaa aacaaaacaa agaatcatta atgttttgcc ttcatgtatg tctgcccacc 13020 

acggacatgc ctggtaccca gggagattaa aagaagacat tagctcccct ggaatggaga 13080 

taggtatgat ctaccacttg ggtgctggga acctgggtcc cctgcaaaag cagtaaatct 13140 

ttttaacccc taagctgtct ctcccaacgc ctaaagattc ttgtaacaca gcatgatgag 13200 

cactggcaag catagcatgg taatctgact tcagggcgcc agattttgag cttaatgctt 13260 

gattattaga agtaacgtac tagatttaat gcctggagct tcaagcaaca aaattaactg 13320 

aagaataaaa ataaaaaccc tgccagccat gatggtaatc ccagaacttg agaggcagag 13380 

gcaggtgatc tctgtgtttt gcaaggccag ccacaatcta catagcacgt tgcagtannn 13440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncga ataaatctac acatgtaaaa 13560 

agaaattcaa agaaacaaat gccaaataaa tacacatatt gtaataaaga gataattgtc 13620 

taaaaaactc aaggctttaa atggtaagat atcatattct tggatgaaaa gatctaatgt 13680 

caaaatatat caatttaatg caattatgta tattcaggag atctctggtt ggcttttgaa 13740 

cttgatagca ctcttataat tcacatagaa gaaaaaaaac catgaaaact gccaaacatt 13800 
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attagaatac tccacagatg gtattttggc agcacataca tcgaagggct gtgaaagatg 13860 

tgtagatcat ccacgccttg ctagggagag ggcgggtgtg tgtggggggt atagctgttt 13920 

gggaaaataa cctggtaatt cctcattagt taaatcatag tcagaacctg gactagcaac 13980 

ttctctctaa aatacattca ccctcagcat ctgcattgcc aggaaaccac tcctagcagg 14040 

atctgtacgt ggatcaaggt agtagcatct gcatttaatt gacattctcc taaatgcttt 14100 

aaattatctc tagattactt atagtagcca agatgatgca aattatgtta cactgtatta 14160 

tctggggcgt aacaagaaaa tgtctctact caggttcatt caggtgcagt acttcccctg 14220 

aatacttctg aatacacgga tcaagaagcc acagaaagag ggctaaccat atacaagcat 14280 

atagtacact aataaccatg tacaaccata tagtacacta atattcagtg cattactcaa 14340 

aatgcaaaca gatggaaaca atccaacagc ctgtaagctg aaaaacaaga taagcaaaat 14400 

gtgctgggcc tagaggccca ggtctataat tccaactaag gtcgaggcag gaggatctca 14460 

agttcaaggc cagcctagac aacttagcaa gaccttgtct caaaacaaaa agtaaagagg 14520 

ctgaggatat agctcagtat agagcatctg cttagcatgt gcactgacag ccgtatcaca 14580 

gaggaaaaaa aaaaataagc aaaatgtgat ctgtctgcac aacaggatat cacagccccc 14640 

taccacaggg gaacgacaca gtaacacaac aaaaacttag ccctgaaaat actatggtaa 14700 

ataaagaagt gtcactgagg atcaggaaat gcatgactcc atttacatta tatagaaatg 14760 

agaagatcag tgagcctcta ggactcaaga gatttgggat tggcagctaa agggtactgg 14820 

gtttctttat gggggtaaga aaacattcta aacttaactg tgagaatgac tactcaacaa 14880 

tgtcaagtgt tcaaaaatca tacttttttt tttttttggt ttttcaagac agggtttctc 14940 

tgtgcagtcc tggaactcac tctgtagacc aggctggcct cgaattcaga gattcacctg 15000 

cctctgcctc ccaagtgctg ggattacagg catgcgccac cattgtccgg ctcaaaatca 15060 

tacttttaaa aattgcccag tgactcatga atacaatcag aggcgggaga ggacagtggc 15120 

aaactcagga taccagtgtc ttttatgtct gctgcccaac tatcaatttc ccatagttac 15180 

cagagaactt tttggtttgt ttcatcttat ttgttgcttt tggtagaatc tcaatatagt 15240 

aagatacaag gctggcctca tactatatag ctgaggacga ctttgaactt ctaatcctcc 15300 

tgcttccatc tcccaagtgg tgggattaca ggggtgtacc gctatgccca gcaagcacaa 15360 

agccatttga accacacccd agccttttca gagaaacctg tacaagcctt agtgccttag 15420 

catattaagg caacaaaaga cataatgcgt ggctaccata gagtgtttgc ctaccatgtg 15480 

tgaggctcta ggctaaatgt ccagcactta taaaaaagag ttaaaaacac tcatgactca 15540 

aggatgacta tgcagtcttg tgtacaaagc cccgcattca atccccagca ccgtgcacat 15600 

caggcaggct ctgtagagga cccagcttaa ggtcatcctt aggtaagtta gaggccttag 15660 

atggctacat tagatgagac cctttctcat aaacagaata aataatttaa agctcctgat 15720 

caaacactat gccttcccat cacactcaga ataaagcact ctactggccc tttaaggact 15780 

gcccatctgg aagagaaacc taagttacat tccttgcttg tgtcatatgt gataacaaac 15840 

tcactggaaa tacgaaaata cagtcttaag cttggtcaga aagcttcccc agcaacatga 15900 

tntcagagga cataatgcag aaagtggaca aatgcaaann nnnnnnnnnn nnnnnnnnnn 15960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16020 

nnnnnnnnnn nnnnnnnnaa tcagaggaca tctttcagga gttagttctt tcctccctct 16080 

atagctttca gggatcaaac tcaagtgtgt actgagcgct tatgcccagt gcgccatcgc 16140 

accaggcctg cttctttgtt ttttatgggt ctgaatcaat tagcaccatt acaacaatgt 16200 

tgacaatcag caagtacctt tctctacctg gctagtaaga gaagtaagtg cctttggtgt 16260 

gtgaacgcag tttctcttgt gaagtgcatg gacttgatct ttgctcacaa cgttttttag 16320 

gtccttaagt tgcttgggtt ttatgggaaa ggctcttggg ttttttgaaa agattttact 16380 

acaacttgat ataatcatta tttttaatcc tttaaatagt atgacttatt ttaacagatt 16440 

aatattgaac tgttctttca ttcttacata ataaatcctg ccttaaaaaa taatcctctt 16500 

agcttccttt ctctattttc aaatttgttt tatatttttg catgattttg aacatttata 16560 

aaagtaggca gacaacacag tagaaccaag tccccatata gctgtgcaca tagcttcaga 16620 

ttattgcctg ataataggtc ctgttttgtc tctgttttct cacagggatg tgttattgtg 16680 

tgtgtgtaca catacataca tatatgtatg tatgtatgta tgtatgtatg taatgaactc 16740 

cctttaacaa aacaagtact gggctgggaa gacagcacag ttagttatgt gtttaaccgc 16800 

acaagcatga caaccagagt tgagatcccc accaaccgca taaaaagctg ggcatagtgg 16860 

cattgacctg tagccctggt gctggatgaa agctggggag gcaggtagat cggcagagct 16920 

tactggcaac aaatctgccc agtaggtaag ctctgggctc agacatccta tataggaaaa 16980 

agatgaaggg cgaggcgcag cggcacacac ctttcgtggt agtgcttgag aggcaggggc 17040 

aggccagtct ctgtgaccag cagcctggcc tacatgtcaa gttgcaggac agccagagcc 17100 

accacctact gagactgtct cagaaataag ttttttaaaa aattgagatg aaggagctgg 17160 

taagatggct tagaaggtaa aggcacttat cactaagcct gaagccccga gtttgaccct 17220 

ggaccccaca ctgtagaacc aactcctcca agttcttctc agacctccag cagagcacaa 17280 

gtgtatgcag acacacacac taagtaagtg aatgtaaaaa acatgacgta gtggcactgg 17340 

cctttaaacc cagcattggg aggaagaagc gggtggatct cttgagtttg agaccagctt 17400 

agcctacata aggaatttca aggcagccag ggctacctag aaagtagctg tttatgaatg 17460 

aatgaataga aggaaggaag aaagagagac agacttaaaa aatatatgct ggagagtaac 17520 

agaagaggac accggcttgc tggtgtcttg acctctggct tgtacacata cacatgtgta 17580 

gtgcatacac ccacatacaa ttgtactcag acacacacaa acatgtactc attcatatac 17640 
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tgcacacctc aacactcaga aaatgaaaaa acaggtacca tttacacctc cgtgttcggt 17700 

ttccaaccac tcatatgtat gggttgtaaa tgcttatatc tgtatgtgtc tgtatatttg 17760 

tgtatacatt caaagttgag tcaggatcca acgtaaactt ggatagtagt gggttgatgg 17820 

tctggaagcc tgctcgcagc tgtctttttc tcctcgtacc ttttcccctg tttgtttcta 17880 

cgacagcagg tcatttgtct ctaagtgtta gtttcccatc ctctctcttt tgctgatggt 17940 

agccttgtag tagtcacctg tgttctctgt aaaatggctt tgccgtgtta tttcaatatg 18000 

ctatcatcct catcttgcta tatttcattc aatatatgta tatattacaa gatagattaa 18060 

aattatttta attttatgct tatgaatgtt ttgcctaagt atattgcacc ttgtgtgtct 18120 

agtgtccaca gaactcagaa gaaagtgtca catattctgg aactggaatt gcaggtggtt 18180 

gtaagccacc atgtgggacc tggaaaccaa atccaggcgc cannnnnnhn nnnnnnnnnn 18240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18300 

nnnnnnnnnn nnnnnnnnnn nncctttggt caaagatcct cagtttcgac tttgattacc 18360 

cagacttcct gtttctctca tggaacagtt tccccctgag atttactagt ggaagaaagg 18420 

cactcaaaaa gcagggagcc ctcgtacaaa tgagacttcc tagctatata attaggccga 18480 

gatgcacaca tccacagtca ttacccttct tcagagcctt tgtcatgtca agtgtatttc 18540 

gcccatgtga actttagaac tggcttgttg tgtttcataa aaagtgtagt tgggctcttg 18600 

attgggattg tgttaaattt atagattcat ttagggagag ctgacaactt tacagtatta 18660 

aatgttttca tccgcggaaa aggttgtctt gccacttact tgggctttct tttatgccct 18720 

taagtaaagg tttatagttt tctttatatc agtcttgcac atttcctgtt agatttattt 18780 

ttgcttgtaa ggtttccttg gttggtattg tgtataaaca ttttctccca attacatacc 18840 

ataattgatg gttaggagta taaataaaag gtagaatttt aaaattgctg tatgaatgac 18900 

tgctttgcta taattcaaca tcttttcatt tttatgccaa tctacttgac atgtttaggt 18960 

taaatgatga tatctgtaca gagtaatcct ctgatccagt atttgcacat cttactttct 19020 

aacgtccata gcatagatac acatcttata ctgttgagta catatatatt taaggtattt 19080 

accatagtct tataatatgc agcgtgcttt ggttcaagac agttgccctg tgttcctcaa 19140 

cattaacatt tttttcatca caaatacaca ttgaccttta tcaaattttt aaaactatct 19200 

tgagagaaat gaccattttt cttaatctgt taatgtaaaa- tttttataaa aatagttata 19260 

aataatatta gcctacatat ttcttctgtt ctctttttca actcttagaa tcagagtatg 19320 

gtagtctcag actaaaccag gagcttccta tctgtttctc tgttcttaag tcacttatat 19380 

aatgtaagga tgctgtgtat atctgccagc taggccttat atacaaaagg cacccatcac 19440 

aaccttctaa aacagtctta ccacttagag accatgttca aacatatggg cctttgaggt 19500 

aattgccaca ttcaagctat aatattgtta tctaagggaa tatctfccact tctagcagat 19560 

gcctaaaaat atctaaaggt aaacactggt aattgctgtg tttgttgatg ctgctcttcc 19620 

tcctcctcct cctcctcttc ctcctcttct tcctcctcct cttcttcctc ctcctgcttc 19680 

tccttttctt catcctcctt tcttttctta tttttgaggc atgatttcac catgtagccc 19740 

taggtaaccc gtaacttact atgtatgtag accaggctag cctctgtctc ctgagtgctc 19800 

atattaaagg tgtgtatcac catatccagc aacacttgct ttgagatggt tagaggaaaa 19860 

aaaaatatac gtaaataaag atggatgcca attactaaat tgttacttcc agtcaaactt 19920 

tgtacctagt ctaaggccaa aatagggatt ttttttctac tttgcaagtt ggctccatta 19980 

agaggctttt cttctcttgg tctcactaga taggaaggag agagaggagg gaaggagaga 20040 

aagcggttga ggagtgggag gtagtgtgac cgagaatacc cagtaggctc atatatttaa 20100 

atatttggtc cctagttgat agaactgttt agaaagatta ggaagcatgt cttaggggct 20160 

ttgaggtttc aaaatttaat gctagaccca gtctttcaag ggagggggcg gtctgtctct 20220 

ctctgcctgc tgcatgcaga gctctcagct actactctag tgtcaagcct gtgtgcttcc 20280 

tgcctcaatg atcataaatt aactgtaagc aagcctccaa ttaaatgctt tcttttatag 20340 

ttaccgtgat catggtgtct cttcacagaa atagtaacct gtggtgattt taatatgcct 20400 

ggaccaggga gtggcacttt taggaggaat ggccttgtta agaggaagtg tgtctctgtg 20460 

ggggtgggca atgagaccct cgtcctaacc atgtgagaac cactcttctc ctattggcct 20520 

tcagatgaag atgtagaact ctcagatcca cctgcaccat gtctgcctgg aagctgcctt 20580 

tgttcccacc ttgctgcccc aattaaatgt tgtacttata agaattgttt ttggggggct 20640 

ggagagatgg ctcagcagtt aagagtactg actgctcttc cagaggtcct gagttcaatt 20700 

cccagcaacc acatggtggc tcacaaccat ctgtaatggg atctgatgcc accttctggt 20760 

gtgtcagaag acaggacagt atacccacat acattaaata aataaataaa taaataaata 20820 

aattcttttt aaaaaagaat tgctttggtc atggtgtctg ttcacagcag taaaacccta 20860 

acataaccct gactaagaca acaagtgagg aaaggtgttg tgtgacactc tggatctctg 20940 

gaagctcacc tcagcatgaa gcttgtcgaa gcgnnnnnnn nnnnnnnnnn nnnnnnnnnn 21000 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 21060 

nnnnnnnnnn nnnatctaag tacactgtac tgtcttcaga cacaccagaa gagggtgtca 21120 

gatctcatga cagaggttgt gaactcagac ctttggaaga gcaatcagtg ctcttaactg 21180 

ctgagcatct ctccagccca aaataattct tactagtaac atggaacaat caagttttat 21240 

tatatgatac atattaatca acttataagt acatgattat gcacatttat catatcgtgc 21300 

aaccatcact gctgtcgttt tgttttgttt tgttcttttg aggcccggtt tctgtgttgt 21360 

tctggaactc actctgtaga ccaggctggt cttgaactca atgatctgcc tgcctctgcc 21420 

tcccaagtgc tgaaaacaaa tgtgtgcacc accacctctg gctatcactg ctgtcttttt 21480 
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ttttttttta acagttattt atttcgtgca tgcatgtgtg tataagcatg taacgtatgc 21540 

catggtatgc atgtggaggt cagaggacaa ctttcaggag ttagttcttt cctcccactg 21600 

tgggttctag gaaccaagct caggttgtta gacttgcatg gcaagtgcct ttaccacaga 21660 

gccatcctgc tggccctact ataggtcctt atataaaaag atcatatgcc gggcaaaaac 21720 

caaacaaaaa ataaacctca aaaaacaaaa ggaccatata atattgtggg ggagtggatg 21780 

aagtcctgaa cgaatgtgtt ctgttgacat gtctgtactt cagacccatg ggaattggca 21840 

aagccttcct ctggtcctgt gaggatgctg atagtctgtc taaaaactag agatcacagc 21900 

tttctcctct ggatgactgt aaccccagat tgttcctctt cagagactgt ccaccaagct 21960 

accctgccta cttaagctgt acacaatgaa tgagctgagt ttccaggtta cagcacagta 22020 

gacactgtcc atcagtgaga gcacagccta gcctaacagt acacatgtct gctttcttca 22080 

cgtttccaga accaagcctt gctggataga gcatatttgt ctgtttggct tatttcactt 22140 

gataaaaagt tttcaaggag ggccaggtgt ggtggcacac gcctttagtc ccagcactcg 22200 

ggaggcagag gcaggcaaat ttctgagttc gatgccagcc tggtctacaa agtgagttcc 22260 

aggacagcca gggctataca gagaaaccct gtctcaaaaa accaaaaaaa accaaaacaa 22320 

aacaaacaaa caaacaaaca aaaagccaaa aatccaaccc cccccaaaaa aaaaaccaaa 22380 

ccaaaaacca aaaaacaaca acaacaaaaa gtttttgagg tttaatttat tgcatgtcac 22440 

agaatttcac tgtttaaaaa aatggctgaa taatatttca ctatccattc acgtatttgt 22500 

aggcattcat gtgtgtagtg gtttaaataa aaatagcccc cataggcttc tacagttgaa 22560 

tgcttagtca ttgagtagca gtactagaga gggaattgaa ggtgtggcct tattggagta 22620 

ggagtggcct tgttgcagga attgtgtcac tttgaggtcc cagcaacaag gttgctctga 22680 

tcacatccaa agacattcta ggtctatgtg atctggctgg aattcagaca tgcccttaat 22740 

acacaccttt aatcccaaac aatgaaggta aagttagttt ataaaaagaa gcacccatgt 22800 

ttgaaagtga cgtttaatta agagtgatga attagagaaa gatctgctgt cacagagcag 22B60 

agaggaaaga gaggcagcat aagagggagc atggcagagg gagagggagg aggggttttc 22920 

accagggcat ttgtacagag acaggttgca gagctagaac aggtgaagac agaacaagcc 22980 

agagaatgag aaggagccag gagattagga cagattgcca atgttaatag gctaagcaga 23040 

gcattttagt cagaaactga gagaagtcaa attgaatcag* ttagcttgga aaggagtttg 23100 

agcagcaaca gctgagttaa actagccaac agaatccaga aagaactaga aaagatgagc 23160 

ttactcagca gcaaatctca gaggctaaaa acatcttaga" cctagattag actgcatgga 23220 

ggctagacgc ttccagggct aggcctaggt tagcagacgg agagagtaat aagccttgga 23280 

gacaacagtt aatacagaag actatgtaca gacatggata tgaacctctc agccacttct 23340 

ccagcgtcat gcctgtctgc attgttagga gtcatctagg aaaggctaag ggcaggcaag 23400 

caacttttcc agagatggtc cactgttttt tgcatggctt ttgagaggcg agctctgaga 23460 

gggaaggttc caagagactt catcccagga ttgctgctta attacgacat gccttttctt 23520 

gtcactgtta tttagtataa tgactcctga gctttagccc atcctattgg gcatatttcc 23580 

tgcagatcaa cataaagatg aactttcaca aattaatgct gtttagatga ataaatgatt 23640 

ttataaaatt cctgatttga tttaaataat tttaggaaga aagctttagg agatagttta 23700 

gttggtttgc cagaaagatg taataacgtc agaatcaaga atagaatgtg gctgggcagt 23760 

ggtggcagat gcctttaatc ctagcacttc ggaggcagag ataggcggat ttctgagttc 23820 

gaggacagcc tggtctacag agtgagttcc aggacagcca gggctacaca gagaaaccct 23880 

gtcttgaaaa acaaaacaaa aagaaaagta agtaaaggct gcataataaa gaatacaatg 23940 

agctttcaca actacaccaa aaagagacat gcttgggaca aatttgtgat caaggaaaaa 24000 

tattcattct agatcaggtc caaggatgaa gccacaagtg tgtgatatga tgaacaagac 24060 

catggataaa ctgttgtttt gagcttaaag aataaaacac tgctttgaaa ttaactatca 24120 

acattctact gtaactttcc tttttataaa ttttatctat gagataattt tctaaagaac 24180 

ttgtgtctat aaaggtatag aaggacagag agaaagaaat aaggtgtggc atctgggctc 24240 

tgctccatcc acccaaataa atatgtgtgt gtgtgtatgt atgtatgtat gtttatctat 24300 

atgtatgtat atacatacat gtgtaggtag gtatatgtgt atgtatataa gtatgcatga 24360 

acacttggga agttgatgag acaagtgaga ggttgggccc ccnnnnnnnn nnnnnnnnnn 24420 

nminnnmmn xmnnnnnnnn nnnnnnzmnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24480 

nnnnnnnnnn nnnnnnnnnn nngaattcac tctgtaaacc atgctggcct tgaactcaga 24540 

gaaccgtgtg cctctgcctc taaagtgctg ggattaaagc atgtaccacc acaacccagc 24600 

tagtttaaat gtttcttatt tttttgttta tgggtctttt acctgtatgt atgtgtgtgc 24660 

accatgtgga tgcatggtgc ccttagagtc cagaagaggg tatcagatcc cctggaactg 24720 

gagtgacaga gggttgtgag ctgggacttg aacctaggac ttctaaaaga gcagcaggtg 24780 

ctcttaatag ctgagcctta tctccaggcc gtcccatgga tttggggggc tttgtttcat 24840 

tttattttgt tttgagacag ggtgtgtagc tcatgcttga atttactatg aagccctgac 24900 

tcccctcaaa gtaaagatcc tcctgcctct gtctacagct gctaggattc gaggtcttgt 24960 

accacatgct cagcacagcc atgattcata acaataaaaa aagaaagaga gacctaaatg 25020 

gccttagaga taaataaatt attttttttt taaagattta tttatttatt tattacatgt 25080 

aatgcacact gtagctgtct tcagaccccc cagaagaggg agtcagatct cattacagat 25140 

ggttgtgagc caeca tgtgg ttgctgggat ttgaacttcg gaccttcgga agagcagtcg 25200 

ggtgctctta cccactgagc catctcacca gccccgagat aaataaatta taatgtatgc 25260 

gtaaggtggg atcatctcag tctccgggaa tcttgcctgt tactccttcg ctctcccttc 25320 
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tattcatgct tgggtaactg gccctggctg attgatgaga gctgatttcc ccactgccct 25380 

gtggcaggga ccactgcgcc cacagggctc cctcaggatc ctcagtacag agctgcacag 25440 

ctgggtggaa gtagagggct gcatatataa cacgatctca actttatttc tttaaataaa 25500 

aattttattt aaattttata cagctctata taaacgaagg aactattgaa ggttcagcaa 25560 

ggacctgcca acggttgtca agggtaatgg cgatgtagtg attttttttc ccccttccat 25620 

tttacttcca tactttctac attaccccac aactggcaag tattatttta aaatgaaagt 25680 

aaatagtgac agatgacttt gaaggaaaat tgaatcggta aaaagaaagc tgagagacca 25740 

cccgggaagc ccaggctaaa tgtaatctgg gtcaggcctc ccaggcctgg ggtctcaaga 25800 

tggtcagctg agggaccctg gtgaccctct tgggccagca gggacgggga ggagccggaa 25860 

gctgagtacc caaagtgctc ctctgggctc caagggcctg cacagagact gtgtgggaat 25920 

caaaggatac aggcatgagg actgaggcct gacgaaccca gctatcattc gtcctagaac 25980 

aggaggcaga gctccaagag tccaaccaag aggcaggaag ctttgggacc cgagatgggc 26040 

gatgggatta gaaaggcatg tttgcaaata ctttcaaatt tacgatgcac actcactgga 26100 

aaccccaccc ctgggtgtcc cttccctgcc tcttgccaca cccaatagct gacatcactg 26160 

gagaaagtcc caagaccagg ctggctggag ctcctgatag gttccaccct cctgcagagg 26220 

gccctcgaag actagcttgc tcgcccacac cgccagatgt ctgtgtcttt ctctcttttg 26280 

cctcccaccc tcgtctcttc ctccaacctc agtggagggt cccctgcttc ctggggaaag 26340 

tagaacttgc cagtgctcac tgtaatgtcg tccctgtagg tgtcatggtc ccccattact 26400 

gggagcaggt atgcctcaga tctccctcta ttcgctgccc tttcaggctg tctcagtttc 26460 

tctctgacag ttcctctcct cctgaatcct gcttgttggc atgcgaacag gctcaatatc 26520 

ttccatctca aaaaacaaac actgggaagg tgttgagaga cagagagcat gggtaatggg 26580 

tgccccagct tggctgggaa ggggtaactt acaatgctct actgcccagt agggtagctg 26640 

cagttgtcaa ttaattgtaa atttcaaaat agctagtaga gaggatttta gatgttccca 26700 

atcccaacac aaagaaatga taaacattca aggcgatggg tatgctaatt gctctgatct 26760 

gatcaccgca cattgtatac atgtttttga aatgtcaggc tgtaccccat aaatatgtac 26820 

aattaccgtg cagtgattca agataaaaac tataatttta aaaagctaaa aacagaagga 26880 

aatagctgcc cttgaccccc ccacccccac aaggtccttc- ctgtttgtcc agccacttaa 26940 

tgtcagagct tcctgtggga gggtggtttt ggtgtacaca gacactcctt cctccctcct 27000 

tccccataag aggagtcacc cctgtcccac gatgccatgc" agggccacat gcgtgatatt 27060 

aaccagtaag atgtgagcag ggatgatacc tgtctcttat aacaaacgga aaaaaaacca 27120 

caccaaacca aaaacaaaca aacaaacaaa caaacaaaaa cagggttggt ctgtccctgt 27180 

gtcttttccc acataaagtt aagcacacaa agtagccacc atttatttat ttgtcccctc 27240 

ccccacccct ccccgagaca atgtttctct gtataacagc cctagctgtc ttggaactca 27300 

ttttgtagac caggctggcc tggaactcac agagacacag agattcacct gcctctgcct 27360 

cccaaatgca gggattaaaa gcatgagcca cgaactaacc agtaccccag agctcttgac 27420 

tctagctgca tacgtatcaa aagatgacct agttggccat cactggaaag agaggcccat 27480 

tggacacgca aactgtatat gcctcagtac aggggaacgc cagggccaaa aaaatgggaa 27540 

tgggtgggta gggaagtggg ggggagggta tggnimnnnn nnnnnnnnnn imnnnnnnnn 27600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27660 

nnnnnnnnnn nnnggcagga tcctgtgttc atgtgcaaca ctcgatgcaa gctgtgtagt 27720 

gtttggttct gagcacctga aggggaccaa gcaggctgat gcccaggcca cgggttcttt 27780 

ctggccccac tgcccactcc caccctctgg catccccatg atgaacatgg ccacagatca 27840 

cactactctg ctcctctccc agatqcacgg agccataggg tccccagatt catctctgca 27900 

gctaacaagc tgggcagtgt cacctccctc aaggttcctt tcctgctctg agcagcagtg 27960 

tctcccacag tgagacactc atgtccactg gaagatattg tagccattaa attcctgtgc 28020 

taaaataact agggggactt gtcaatcact acactcttag ccccggactt ctgactcata 28080 

gagggtggtg acagctcagg gacctgcatt ctaccaaata gccatgtgtc cctgatggag 28140 

gaactgcccc tggacaacct ctgcagcaac tgaaccctct gtggtctcct agttcttctg 28200 

gacaggtgtg accccagtac ctagtgccag gtgagagagt gctagggcca cactaagggg 28260 

tgacaggaca aggttggagc tggtagatgt ttgggccacc aaagagaaca ggtcagtagt 28320 

aaaagccatc atggcctgag ccagcctgcg agtctcctct gcagttggga cactcttgca 28380 

gtgtcctggg gacctcttga gggtagcatg gtcaccaaaa tcctacaagg acagatcaga 28440 

agtcagtgag gtcaagggaa cagctctagg ttctctgtgt ccctcacgga cctttttttt 28500 

tttttttttt tttttaagat ttatttattt attatatgta agtacactgt agctgtcttc 28560 

agacagctcc agaagagggc atcagatttc gttacggatg gttgtgagcc accatgtggt 28620 

tgctgggatt tgaactcagg accttcggaa gagcagtcgg tgctcttaac cactgagcca 28680 

tctctccagc cccctctcag tcctgatgcg acagggcagc aaaggccttg tcccagatct 28740 

gaggagagtc atgctgaagt ccttcctacc ccaccccttc cgaacccctg aacatcagcc 28800 

ccataactac tgactccccc acccccattc ccttgcttcc actgatccgg tcctcctctt 28860 

ccctctggcc ccacccattc ttccccagcc ccacctgatt gtacctggtt gtccaacttg 28920 

aagagggcag gcaggggcag cttctgctgg gcctgctcac tcactggctg tagaaatgag 28980 

aaaggagatg aagaaaaggc ccttcccatg ggtccccatc ttgccaagac ataggtgagt 29040 

ccctttggct cttcccccta aacctctcac ttttgagtac ctgctggccc gggagatcca 29100 

cggcgctcac cggagagaac tgttgagaaa agggagaaca gagaactcag cgttcctccc 29160 
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tctccaccct 

actgaccact 

caggacaagc 

ggagtgttag 

cttgaccgca 

ctctgagagg 

tttcgtctta 

gaccttgcgc 

agttcttcct 

gcatgatgcc 

ctttctcagg 

cagccagaaa 

gaactacaag 

ccagctggcg 

cctacccagg 

cctccacaac 

tttttctcgc 

cctggacctc 

ttcatccaga 

gctgcaccac 

gtgttctcct 

ctctcctccc 

gtaaatattg 

atcaaggcta 

gaggaggttg 

cagtgaggct 

cagccaaagt 

caccaccacc 

caccaccacc 

rmnnnnnnnn 

nnnnnnnnnn 

gaacatctct 

gtgtgatcct 

tgcctacaca 

ccctttagct 

ggctcttagc 

ctcccatttc 

cttggaccac 

ccccctggag 

ctgtcctagt 

ccatcaccat 

ttctgacttc 

ggtccttaaa 

tcgttccttg 

taatccctca 

caggggcact 

gcagatttag 

tggctgagcc 

tctgggtagg 

agtaaatgag 

ggtggtggga 

tgaggtggga 

caagcaggga 

ggaaatggag 

atgcaaggcc 

ccaggacagt 

gggggaggag 

tcccatgtgc 

tgagaagaca 

tgttgctcct 

gacagtgcag 

ggctggctgg 

gaaatcttgg 

gaaaggtcag 



tctggcctct 

tcccactcag 

tgagtacgag 

ttcctacctc 

ccaggctgcg 

tgggatttcc 

cactgtggat 

atctcccttc 

gctcccatgt 

tggcccccac 

ctgccctggt 

cttctttgtt 

cctcatccca 

gaagggggga 

ccttatttca 

tgtgttcaag 

caagagtagg 

tgctcctcac 

ccccttgcca 

ccctccatcc 

ggaccagctt 

acaggacctt 

acctgtacat 

cccagaaccc 

ggcctggcaa 

cagagaggct 

cccaccacca 

accaccacca 

accaccacca 

nnnnnnnnnn 

nnnnnnnnnn 

ccttttttcc 

tctgcctctg 

atgtcttagc 

ggcttctctt 

cccagagcct 

cctctatctc 

aagacaaaac 

tcaggtctag 

gtgccctcac 

cctctcatgt 

ccagaaaaat 

attaagtcag 

ttcactctgc 

tctatagcag 

gccactcagc 

gcttcttgtt 

tcaggaggct 

tagcaactga 

atacccttgc 

gtctaagtca 

aatctgatgt 

caggtgctag 

actctaaagg 

ctggctgcaa 

gatgccatcc 

ggtccgggga 

cccaaatcag 

agtgggagcc 

cctcctccca 

gggaggacac 

ctcatggcta 

aatctgaatc 

ccctgcagaa 



cccagatttg 

acctcagctc 

gagcccccgg 

acgctcgaag 

gaacctggac 

ataagctttt 

cagtcctggg 

atgcctttgg 

agatgttgag 

cctagcaaag 

cctacccaaa 

gaccaatgaa 

cttctgctcc 

ggctctcagc 

ggcaccagct 

aaccttcagc 

cctccaggtt 

caccccagga 

aggtgccagc 

tacctcttca 

gctccccacg 

tgctctcagc 

ccggttaaac 

tgcctgagcc 

gccttctaat 

gtgtggcctg 

ccaccaccac 

ccaccaccac 

ccaccaccac 

nnnnnnnnnn 

nnnnnnnnnn 

tttttgagac 

cctcccaagt 

agcccttagc 

gtgaagggct 

tccttccttc 

cctaagcaac 

tgcagcctgg 

gggaggaaga 

ccatcctata 

acacgtgccc 

atctgatctg 

acaatccatg 

ttttggcagc 

gtctcccggt 

tgctcaggga 

tcccttccca 

gatttatcag 

gacaggagga 

agatgggacc 

cagatctttg 

gcatggggaa 

ggggatgtct 

gcagaaagtg 

taactgggtt 

tgtccaaagg 

gaggggtcaa 

gaggcacaga 

agtacatgtg 

gagccaagct 

ctctctctcc 

cgccgctcac 

agcctgagat 

cagaactctt 



cccccgcccc 

tgcctcaccg 

agcagtgcca 

gccaagcagt 

tcaaaacata 

tttttcacct 

ttcaaattta 

ctttaccctg 

gacccaaata 

ccacctgacc 

ggctctgaga 

tgactggccc 

aagttctgat 

ctagagagcc 

cttctaaaag 

cagggcctca 

ttggagttct 

cgctgtgaag 

tgtctacttc 

ggcttcttag 

ctcccccagt 

caacccccgt 

attgatatgg 

tggagaaggg 

gatccctcaa 

tgtaagggcg 

caccaccacc- 

caeca ccact 

cacctcatct 

nnnnnnnnnn 

nnnnnnnnnn 

aatcttacta 

gctgaggtta 

agcaccaggg 

atgtcttctt 

aggttaaaca 

gaccttttct 

gctgtgtgtc 

cagggttcac 

gcacgcacat 

tccctcgcca 

agaagttagg 

ggacatgaag 

accaccagca 

gggaatttta 

gagacccctt 

gcccctccca 

agaggtgctc 

gatggtcacc 

cctgaagttc 

ttaccacgtg 

acacagaggt 

gaatgttcca 

agggtgtgca 

ggggaggcag 

gcagtgtcca 

gattctcccc 

aactgggatg 

tttcagatta 

gccttcaagc 

actggctcaa 

cccctggaca 

accccataat 

tggtccccac 



ccagcatctc 

tgaaacaggg 

tgttcctgta 

agactgctat 

gcagctgtgg 

gtacatttag 

aagccctcat 

tcttggtaat 

agaatctctg 

tgttgttcat 

gctaatctgg 

agacaccttt 

ccagggtgct 

ttcctttcca 

gtccttctgt 

tctccaatct 

agaggtttct 

ctgcaggctc 

ctctgctgcc 

cgcagcacac 

gcagccagca 

tcagcttgtg 

gggccagaag 

gtttacagga 

cataggggat 

cagagtgggc 

accaccacca 

accaccacca 

acccatacta 

nnnnnnnnnn 

tctatgaggg 

tatttaggct 

caggcatgca 

gtcaggaagc 

ccccttccta 

gcaccagttt 

gctctgactc 

ctcgcacatc 

gactcagaaa 

gctgatgtgc 

gacacatgca 

agtctgccat 

ggcaacaacg 

ggaaccaacc 

gggacctctg 

agaacaacaa 

tcacaggcaa 

agagaggcac 

ctgggcatcc 

ctcccggggg 

gttagactga 

ccaatgctgg 

ccccaagcca 

gcaggggctg 

tcattggcta 

agccagattt 

ctctgagtca 

ttgtggtctc 

aacccagtcg 

cacatggcag 

ggacagtttc 

gtttggggtt 

tgtacctccc 

ccatccccct 



cttcagcctg 

accttgcagg 

tccagaacag 

ccatgggttc 

acctcactca 

tcttcattct 

cttgcaagag 

tcatggcaga 

taaatactga 

ttcatccagc 

gctggcaggg 

ggacttacgg 

tcggggaagc 

tcctcagccc 

tatccctaga 

ggatatatga 

cctggagctg 

cctgaataaa 

caagcagcag 

gcagcacacg 

gggcctagct 

ttcagtgctg 

accctttccc 

gcagataagt 

tatccacagt 

tccagagtca 

ctaccaccac 

ccaccaccac 

anttgaggct 

nnnnnnnnnn 

ttacatttta 

ccccttgaac 

cagtcacatc 

cctcaactgt 

gcatggagac 

ttggtgggac 

tcatctggca 

attcctgtgc 

agaccactgg 

cccctccgct 

tcactaactt 

catcagctat 

agaagactcc 

tggctctccc 

tgttctcatc 

agaaatcaat 

cagcctccct 

ctctggtccc 

tctaccagga 

cgggggtggt 

ggactgaatc 

ccaagagcta 

caggaataac 

cacaggacac 

gccaggggca 

ctaggctcca 

aggttggcct 

acatccaagc 

gagacaaaca 

tgaatatgcg 

aaggggttca 

tttccctcct 

aacaccccca 

cagccctgga 



29220 

29280 

29340 

29400 

29460 

29520 

29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 



77 



wo 02/29059 



PCT/USOl/31488 



ggctgaactg atgggcagct aaggtccaga cagtggctgg ctcttggaaa gcctgtctct 33060 

ttcctttgac tcagaccact ccctgccgtg gcttacatca ggaggtgcaa gggctgcagg 33120 

agggcagcca gaccccacaa accagctagg ctaaatggtg cttattgttc gcaagaggcc 33180 

atgacctcat ttgtctccca gctcttttgg taagagagaa tgagaggaag ctggacagag 33240 

aacctagcag gcctcaggca gcccactgct ccttgctgta agggaaccag caccgatggt 33300 

tctgaaaagc agcgatccga atggagtcag gctgagctgc aggaagctca ccttccttgc 33360 

tcactgctgg tggaagcaac ttcaggaaga gcccagccta tgggactata gctcctccgg 33420 

ggtactgctg agtccagccc cagagcttag ctccctgctt cccaccaccc accaccacat 33480 

cctttcccaa caccattcaa aaccccagtc cagcctctcc tactggtcta cagtgagcgg 33540 

ctaatagagt cctgggcctc tgtcccccca attctctctc ccctctcatc tgttcacctt 33600 

ggttcctaaa ctgcaggggc tactataacc ctacctccac ttccttgcac ccctcttttc 33660 

tgctctctgg ggtgcccctg ccactcccag tccctctagc cagggagcct cttccatatc 33720 

tgtcttcccc aggctagacc aggcgctgcc ttacctgtgg ttgcggcagc ttctctcaca 33780 

gcctgcactc tgaggggctc caggaagcag tgaggggagt agctgcctct caaccagcgt 33840 

ccagcaggct tcagattaca gctactcttt tcttaaagtg acctgactcc atttggaatc 33900 

tgtgattgca tcattgtctg gtgttaactt taacccactg ctgcccttcc gccatgtggc 33960 

tccaagacca cacgttggcc accctcctct cccaccacat ctcccttgga tctttatctc 34020 

tcttcattgg gaccttcatt gggacatgat ggctaacttc aggggcactt gggccagcct 34080 

ggggtaggtc atgagtctga acttgaacat ctgaaaggat tggctgagag gcaggctgca 34140 

tggagagact gtgagccagc cggtatggag atgctgggtt cttccaggcg cttggctctg 34200 

gctcactgca ggtgggagca aggtgattct tctcccctcc tcacctggaa aatgaaggaa 34260 

tgggactgta cctgacagct ctgaaggttc caaaggacag tggggtgggg actagagagt 34320 

tggcccagtg cttatgagca ctggctgctc tcgcagagga cctgagttct gttcccagct 34380 

cacatagcaa ggactcgaaa ctgcttggaa ctccagctcc agagaatctg acgctatctg 34440 

ctnnnnnniin nnnnnnnnnn nnnrmnnnnn imiinnnnnnn nnnnnnnnnn nnnnnnnnnn 34500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtcagg catggtggtg 34560 

catgccttta atcctagaac tcaggaggca aagcagatgg- acatctgaat ttgaggcccg 34620 

cattgtctac atagcaagtt ctaggctagc caggtacatc ataagaacct ttctcaaaaa 34680 

ataaataggg ccagttggca aaatttagct tgccctccta acacaagaac ccaaagtcaa 34740 

ccccagcaac catgtaaaaa gaagcaaggt gtggtggcac ttgcttgtga tccagcattg 34800 

tcaaggtgga gacagacgga tccatggggc tcactggcca gccagctagc tggtctactt 34860 

agtatgctcc cagccagtga gagactgaaa aataaataaa taaataaggg gttaggaaga 34920 

ggtaacatgg tggctcagtg agaaaagata cttgtcatac aagcctagca accctcaatt 34980 

ttcagtggcc actaaaggtg gaaggagaga accaactcca aagaattgtc tcctgacagt 35040 

tttatgctgt ggtacacaca cacgcacaca cacacttgtt acatgcatat gcatacaatt 35100 

aataatttaa aatgttatgt gtatgggtgt tttgcctaca tgcatatctt tctgtgcacc 35160 

acatgtgtgc aatgcctgtg aaggctagaa gaggacatca gatcccctcg gagttacaca .35220 

gggttgttag ctaccatgtg gattctggga acaaaaccat gggttttcca aaagaggctc 35280 

ttaatcactg agccatctct ccatcccctc aatagtatat atttctgggg ctggagagat 35340 

ggctcagagg ttaagagtat tgactgctct tccagaggtt ctgagttcaa ttcccagcaa 35400 

ccacatggtg gctcacaacc atctgtaatg gaatctgatg ccctcttctg ctgtgtctga 35460 

agacagctgc agtgtactca tatacataaa gtaaataaat aaaccttttt tttttttgtt 35520 

ttgttttttt tgtttttcga gacagggaga cagggtttct ctgtatagcc ctggctgtcc 35580 

tgaaactcgc tctgtagacc aggctggcct tgaactcaga aatccgcctg cctctgcctc 35640 

ccaagtgctg ggattcaggg tttgcgccac cgccaccacc tccaaggctg ctgctgcggc 35700 

caccaccacc ccaccaccac actacctgac tatttaactt ttaaaggcag ccatctcatg 35760 

gaaaatgaca cctagcattg tcctctggtc cctacatgac cccatgtgca aacacatacc 35820 

tgcataaaca cacataaata cataagtaaa cttagtctgg ttgttttgga aatgtgctat 35880 

ggtttggatt gtgtcccccc aagggaaaaa ttgggtccca gagtagtact attggaagag 35940 

agtagaactg ttagggttta ggcctggtgg gaagtggcca tggctagaga gacatgccaa 36000 

ccaaggggaa tctctggctt catcttttcc ctttgctttc aggtcctaag acagtcacaa 36060 

ggctgcttca ccacatgccc agaagcaagg ggagcagtca tggctgggac cgctaatgca 36120 

gctgttgatt tccccagata tttgtagtag taagagacag gtgaggaacc ccacagcaag 36180 

tgttagtaat tgtgtgtgga ggtgccctcc ggggacgggg gccctcctgg ggcaggacgt 36240 

tcctcttcct catccacctg cactccgaga acaggaaatg gtgactttgg cagagcttaa 36300 

gcagagcccg ttcatgttac aagtatgtaa attcataagg accagtttct ctccatatga 36360 

aacagcttca aacaggagaa ggaagaagca aacattaagg aaaagctctt ttattgcaga 36420 

ggctacactg aagctaccgg ccgccttcct ggaatgtata atcagcttcc ctctgggggt 36480 

tctgtagagc actgagacat taagtactac tggggtccag gattctgcct atgaagagga 36540 

gggcccccgt gtccgtgtcc ctcagaacaa agaggaaagg ttggttaagg tgatagtcta 36600 

gcgggaaggt gaggcggacg ggctggaggc ctgggctggg gctgcttcct gccccctctt 36660 

cattccactc gaaagcagcc ctgtgttcca cttgggtgag cttcacgggt ttgccagtaa 36720 

tcttgctgaa gtcgggtgat tcaaacaacg actgtagctc tgtggagatt cagagattcc 36780 

attaacacca cacacacaca cacacacaca cacacacaca cactccctgt ttgtgtaggc 36840 
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tgattttcaa gaaagcaagc tagaagtgga gtacctcaca gtgacttgtg agctatgagg 36900 

cactctgtga caggctcagt gacctacctg agaacttata gccaagatgg ctgaagccag 36960 

acctggcctg agagaatgtt ttgggctgtt ataggacaca tagagataca cacacacaca 37020 

acacacacac acaccaagga ctgagtctaa tgggaggtgg ttcttcattc ccctcccctg 37080 

taatggtgtc acatgttccc tgagccaccc tacaaagaaa gccacaggac tcagttctgt 37140 

cagcaaggtg gcaggctcca agactcagcc ccgagcgcaa agtggccttg caaacatact 37200 

catgtcctgc agagacttgg taagttcgcc ttcgaagctc agcttcagct tggggacagt 37260 

cagcacagct tggatagtct tcagttctcg gtcgatgtca tgaatgaact cagaggtgag 37320 

gctctcttct atcatggtca agttctgggt cacggtcagg ggcaggaaga agatgatgct 37380 

catacttcct gtcaagggca gctgggcaat ctaacccaac agagatgcgc acaggttagt 37440 

tgtgagccag aaaaaacaaa acaaacaaac aaaaaaacac caacagctgc cttcccctct 37500 

gctgtaacgg ggccccagcc ttgtgctccc cagcctcagc ctgggctgta ggctactggt 37560 

tactggcagt ccttccatga gtagggagtt ttcttctcag cctaaaaccc acagaagttt 37620 

aatgaacaca cgtttgtttg tggttccgct acggtttcta ttgtgataaa acatgactga 37680 

aagcaacttg gagaggaaag ggtttatttc atctgacaat tcgcagggtg tcttctcatc 37740 

actaagggga ctcagggcag gaactgaagc ggaagccgtg gaggaacgct gctttctggc 37800 

ttgctccccg tggcttctta gcctgctttt ttatgctatc cagaaccact tgcccaggag 37860 

tgacactgcc cattgtgggc tgggcccccc cacatcaatc actaatctag aaaatgaccc 37920 

acgggtttgc ccagaggcca gtctggtggg ggcattttat caattgagtt tcacccttcc 37980 

aaatgactct aacttgtgtc aagttgacca cacgaatcag ggcctggttc ttaggagctg 38040 

aagtggaatg tcccccagag actgcctgcc agcactgctg accatttgct ttgtatagag 38100 

cattgaacca gaaatgaaca ataaaatgga tcctttgaac agatgtgttg atcctagggc 38160 

ctgtggacac agcgactggg cttcccagag cccccatgga atcannnnnn imnnnnnnnn 38220 

nnnnnnnnnn nnnnnnnnnn imnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 38280 

nnnnnnnnnn nnnnnnnnnn nnnntggcct catcaggtat cagagagaga gagagagaga 38340 

gagagagaga gagagagaga ggataaaagg ttagcccagt ggtggtggca cataccttta 38400 

attccaacac ttgagaggca gaggcagggg gagctctgtg- ggccagtttg gtttacagag 38460 

taagtttcag aatagccagg gctacacaga gaaaccctgt cttgaagaga aacacacaca 38520 

cacacacaca cacacacaca cacacacaaa taagatcttt aagaagaaaa gaaaggatag 38580 

tggggaaaca tctgagcaga ggaagaaatg gggtgcgcag gacacccacc ctcagaggag 38640 

gccctcactg gaggtgtctg cacaggagaa cacttgcact cagcttgccc tagggcgtca 38700 

gaactcagaa ttcagtttca aagcactgac aggagcagtg actggggacc ccaggttgaa 38760 

tccccccttt atctaaaatg agtaagaacc aaaaaaacaa aagtgtttgg gatttggaat 38820 

ctgggttatt tgcatctaca aaagaggtct tggggaggaa acccagttct acccctggaa 38880 

ttcatcagtt tcctataccg ctgactacac aggggctgaa ggtaatctca atgttttcat 38940 

aactggtgtg gtgctacttg ctcatgatcc caacacttgg aaggtaggtc agaagttcaa 39000 

gagcagtctt gactactcag tgaatttgag gctagcctgg gctacatgaa aactcataaa 39060 

acaataaaag aaaagaaaag gtggcgagtg aggtggccca tcaggtaaaa gtgccaacca 39120 

cctcgcctga aagcctccac acggaagggg aaagccagct tctacatgtg gtcctctgcc 39180 

ctccgcatgc accatggctc gtgcaccccc acacccaccc acccacccac ccacatgaca 39240 

taaatacttg taatgattag tttctgaaga acaatatttt cgttgatctt gtttagggaa 39300 

caaagttcgt gcacattgac ctgtcgaccg tgtagtacgg gatccgctcc aggaagctaa 39360 

agattttggg atgcttcatg ttgcagctac tctgatgaac agtgttgctt tctgggccag 39420 

gtaatggtgg catatacctt tgatcccagc acttgggagg cagaagcatg tagatctctg 39480 

tgagttcgag atcagcctgg tctacagagt gagttccagg atatccaagg ctatacagaa 39540 

aaacccctgt ctctaaaaat cactaattta aaaaaaaatt cctttctaaa cctatataac 39600 

aaatgttttg taggctgcct taacaaagcc caatggccat tcagagaagg ctcaaaagag 39660 

aaagtttagg ggactctaca agcatcctca ggaaggccac agaaagcaga gcctgggcca 39720 

gtgagacttt gcagtgggca aggttcagct ctttatgtag gaagaagaga gtcaacagtc 39780 

agagtccagc tttccataaa acctgtgcag ggcctctagg caaagccctg tgttaggggc 39840 

aaaggcattt gcagtctaag cccggtgaca tgagctcaat ccttggaacc caggtggaag 39900 

gagtgtgctg actccacaaa tttgtcctct gatctataca tgtatgcacg tgcacgcaca 39960 

ctcacataca tgtccacatg cacacatgca tatacatgcg catgcgtgca cccacacaca 40020 

gggtcaaaag cagcaagaga tgccctgtga aaaacgtctc attcagtctc ccatcatcca 40080 

gtgccacact ctgagcacag gtggtactga tatcgttcct gattgatcga tcagttgatt 40140 

tgagaccccg cctcactatg tagcccaggc tggcctggaa ctcacagtga tcctcttgct 40200 

tctgcaagat gagcccatca tgcccaccat gttattgaag caataccatg ctctataaag 40260 

caaacctagg caggcaggat ggtggactcc tgtaatctca ggacttgaaa agtagaaggg 40320 

agatgaggag ttcacatcaa tctcccgtat gcgttggagg ctggagtggc tgttccctgg 40380 

gcgcttctgc cagcacctga ccaatgcaga tgcagatgct cacagccaac catcagactg 40440 

agctcgggac cccagtgagg gtgctggggg gaggactgga ggagctgaga cgggattgca 40500 

agcccatagg aagaacaatg tcagctggcc aaaccaccca gagctcccag ggactagacc 40560 

acgaaccgag gactgcacat gaagggatcc atggctccag atgcatatgc agcagaggac 40620 

agccttgtct gacagcatgg gaggggaggc cattggtcct gtggaggttt gatgccccag 40680 
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tgttggagga tgctggagcg gtggggcagg agtgggtggg taggtgggga gcaccttcat 40740 

agaggcaaaa gggatggggg agaaggcaga tgggatgggg gggttgtgga ggggtaagaa 40800 

agaaaaaaga tgtctctgaa agtaaaaagt acttgtcact aagcatgagg atatgagtca 40860 

acccccaagc cccacagggt ggaaggagag aaatgagtcc cacaagttat tttctgatct 40920 

atacgtgcaa tccatggcat acgcagaaac gcaaagacag acaatgagtt gggtgtggtg 40980 

gtgcacatgt aattccatca ttcaggagac agaagcagca gagttgttgg aaatctaagg 41040 

ccaacctaaa gacctacacc caaagaagga caaactataa ggaaaaaggt ggtcgaccaa 41100 

tgtaacatta aagttagaaa tctctcttca cactgtgtag atactgtaca aggaagagaa 41160 

aaggcagcca catcaaaaca gtgtaaatca acgagaaaac cagaaacaac tcaagagaag 41220 

gctgcagggg cctgaattct gttctcagaa cctgcatcaa gccaagagaa tcaaaactgt 41280 

ctgtaactcc agctccctgg gatccaacac ccatttctgg cctccatcag catcactcac 41340 

aggtgtgcac acatacacat caataaaaat caaaaccagg gatgaagggg tagggaggtg 41400 

catgtggatc tgggaggagc tgagaggcac tgggtgaata caataaaaaa tttggtgcat 41460 

ggtggtgcac gcctttaatc ccagcacttg ggaggcagag gcaggcgaat ttctgagttc 41520 

aagaccagcc tggtctacag agttagttcc aggacagcca ggtctacaca gagaaaccct 41580 

gtctcaaaaa aacaaaacag ccgggcggtg gtggcacacg cctttaatcc cagcacttgg 41640 

gaggcagagg caggtggatt tctgagttcg aggccagcct ggtctacaaa gtgagttcca 41700 

ggacagccag ggctacacag agaaaccctg tcttgaaata aataagcatt tgttgctgtt 41760 

acagaaaact ccagcccagt ttccagcaca cacagggtga ctcacaacat cataactcca 41820 

cttccagggg atccaatgcc ttcttctgac ctctgtgggc accaggattg catacagtgc 41880 

acagacatgc acataggcaa aacactcaca aaataaaata aatctagcaa aaaaaatttt 41940 

aactaataat ttaaagaaaa aaataaggaa gccgggggtg gtgtcgcacg cctttaatcc 42000 

tagcacttgg gagacagagg caggcggatt tctgagttcg aggccagcct ggtctacaaa 42060 

agtgagttcc aggacagcca gggctacaca gagaaaccct gtcttgaaat aaataaataa 42120 

ataaaaaata aggccaagta attcttggaa gaatcccaag gggacactaa gtgtatataa 42180 

aggcgttcca tagggctagg aatgaggctt agcgagagca acttcgctgg tgtatgaaag 42240 

tccctcagct gcatgtggta cctttaatct aggctctccc* gaagcagagg cagaaggatt 42300 

tctgtgagtt caaggccagc ctggtgtaca tagctagttc caggacagaa agggcgatat 42360 

aatagaaaca tcntacctag agcccngcca aanaaagggg agacctgaga ccagagagat 42420 

gactcagtgg ctaagagcat tgactgctct tccagaagtc ntgagttcaa ttcccagcta 42480 

aaaatttatt taaatgttta ttacttgtat tattatttaa atttaaataa ataagtaaat 42540 

gggagcctag gtttgagtcc ccaaatcacc aagaaaaaat gttatcattg ctaataatca 42600 

aattaagagc ataagaactt ctttttaaag aattcttatt tattttatgt atgtaagaac 42660 

actgtagctg tcttcagaca caccagaaga gggcattgga tcccattaca gatggttgtg 42720 

agccaccatg tagttgctgg gaattgacct caggacctct agaagagcag tctgtgctct 42780 

taagtactga gccatctcta cagctcttat caggttgata aaatttaatc tcgtggagcg 42840 

ctgagaccaa gaactaaagc tgggagattg aaaaatgcag accaccaagg ccctgctcat 42900 

ttctccagtt ctgatcagct cccgtaccag gggtctaacc aggcctgtgt ctgcttccct 42960 

gagtagacca gaggccccat ctaaacagcc tgcctgcagc agctcctctc tctaggtgga 43020 

cagatgggaa tttcagacca atgtcatttc ccaggacatc aacacagcag ccaaatttat 43080 

tggtgctgtg gctgccacag ttggtgtggc aggatcaggg gctggcattg gcacagtgct 43140 

tgattattgg ctatgccagg aaccagtctc tcaagcagca gctcttctcc tatgccatgc 43200 

tggggtttgc cctgtctgag gccatgggac tcttctgttt gatggtcgcc ttcctcatcc 43260 

tcttcgccat gtgaggctcc ctggggtcac ccagccgtcc ctgctgcctt gactccatgc 43320 

cagtcctggt gctggagtct actgagattt accattaaac agcaacgttt ctctaaaata 43380 

ctattaatta attaattaat cacgtgacaa ccccagcgtc catatgggtg tggaaaatga 43440 

ggaactctac ccatcataca tggcgactat gaagaacaat gtgacagaaa atgctaacat 43500 

catgtgtgac cgcatgcatc agccctgact gctaaaagtg gacaagcccg aagcgaaagc 43560 

ccaatgttct acttctaaat gcatgcacca aacgcctccc acaggaccag aggtgcagct 43620 

ctgatagggt ccttgcctgg catgcatgaa gctgtggaca cgaggcatta ttcgcaagaa 43680 

cattctagct gtctggaggg ccctcaatcc actgtgttcg cgctgttcca gcaccagtgc 43740 

ctcctggggc tgcacctgaa aaaggggact gcttaagagg gctcctacca agcctactgc 43800 

cacagatgca tgatgggaaa gccttctgga agcaactggc tgccaaaggc tctggacaag 43860 

agatcaccct ctactggaaa ggtggtttca gtctaggttc tgtgggattc caggaaatta 43920 

gacaacactg gcagtccaac agacagacga tctaaacttc caaggcacag ctggtagaac 43980 

ttgctgcgga accagacaac aaggtacgag ctactcccat acaacataca aaaaagcaga 44040 

gagagagtca gagacagaga cacacagaga gagacagaga gagagtctaa agagagtcag 44100 

ggtctcagga ctgagggtat agtctactgt agagcatttc cccagcatat acaagaccct 44160 

ggattcaatc tgaacacagg aaaaaagggg gggggggact tcgattatct caaattctcc 44220 

tttttgtgac acacccctaa agtcactgcc tacttccctc accgccatga agtaaagagc 44280 

tgtttgcgct tatgtctaca cagtctcggc tcccacttcc tcctcccctc tgcttctgtg 44340 

ctcatctcct ctgaaaccac tgcagcaagt gacttgtgtt gactgccaca cggaaactct 44400 

cctcagtagc aggcagcaga gcagagctct gtcttctcgg agcttcttct ctcttgtcgc 44460 

cattttctcc cacccttaag taccctatct tctctgtctc tgcttgttga tccttggacc 44520 
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cttttccttt ctatgaacaa aatatctcct taaaggatct cttctagttc agggtccccc 44580 

cgcccccact gtggagaaaa cccagggcct tgcacatgct cagcaggagc tccatccagt 44640 

ctctagctcc atgacttaaa gcatctctgt gctgtcaaat atacacttcc agcccttacc 44700 

aaaatattca gtcaactcct tgccattcaa aatggatgac ctcaaagcca gagtcagcgg 44760 

tgctatgact cccagatcca tccacttggt agcccaggaa tgaactcann nnnnnnnnnn 44820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 44880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnngg ttactgggat ttgaactcag gacctttgga 44940 

agagcagcca gtgttcttaa ctgctgagcc atctctccag ccaccaccac caccaccacc 45000 

accaccacca ccaccaccac caccccacca ccaccccgcc ccacctgctc attcctgatt 45060 

ggttggttag tttagtctgt gagacaggag ctgtcccttt tctatagtgg aaggtgaata 45120 

agaaactcct gaaagtgaag gcctacaaaa cagccacact tatttgttgg aaaatactgt 45180 

aaatgtgaca tgtaaataca tgctaaaata attcgttaag tcagtgaaca accttaaaac 45240 

acagtctgta gcctgaatta cagacacgac acgagccatg acagaggctg aaataagacg 45300 

cctttgcaag gagaagggca gaagcttcca tccttgctag caaatctttg ttccaagctt 45360 

tatcagattt tattgctttc tctttctgtt tttctttatc atatttgttt atttgttggg 45420 

ggaaagccta atcttcatag cccatgtatg tgagcacttc agcatatgtg tgtgaacacc 45480 

aacagcacac gtgtatgagc accacagcat aggtgtgtga gtaccacagc acatgtgtgt 45540 

gagtacgata gcacgtgtat agagttcaga ggagaactga gagagtccgt cttttcctcc 45600 

tactgtgtag gtctcagggg tggaacttgg gctcagcctt ggtggcaagc tcctttatcc 45660 

acagagtcat cctgccagcc cagctttctc tttttctctc tgttatgtct atccactctg 45720 

ttcaaggcta actcactgac tctgagttat cagaactgct tgtgagagca ggagtaactt 45780 

tggacatctg tgctggtagg aacaccatcc ccactcggct tggatgacga aggggaaaaa 45840 

aagcatcacc aaggagttcc accacctcaa ccagcaaata tttacctcct atacatggat 45900 

aggtggggtg ggtgagcctt gtgatttatc gttaggatct catgggagtg attacagctg 45960 

gtctactcca tgaccaaaat ggtgacggtg gctgaccaaa aagaaacagc tacacctggc 46020 

tctagttttc tttctttctt ttttcttttt cttttacccc acggtactaa ggattgaacc 46080 

caggaatgca agagctctgc caagtgagct acattcccag- atctgttttt ccatttcttt 46140 

ctttcctttt agattttatt tcgatttatt tgtctatgtt tgtgtgtatg tgtgtatgct 45200 

tatgtgtacg tatcagtatg ccatgggtat acagaaacct* gagaaggcca gaagagagtg 46260 

tctgggttac aggagtttcg agctgtcctg tgggttctca aatgcagcag caccaccacc 46320 

aatcaccccc acccccaccc agccttcgag ttcaattctc atcatcacaa aaacacacac 46380 

acagacaagg gcctgcaaga tggctcagca ggtaacgaag ctcgtgtcat aagcctgaga 46440 

acctgagttc actgtctgga acccgcgtaa agggggaagg gaagaatcaa ctctatgatg 46500 

ttgtcctctg ctctccccat gtgtgccatg gaatgaacag ccctcccaaa cacacatcaa 46560 

gaataaataa aactaaaatt agcttagtaa cttttatgtt gaaagtggtt tttacatgcg 46620 

tgggcaacaa taacaccgag agtagaaagg caagcatgta tgtcactgaa cagcattgaa 46680 

gaaaaaacaa acacatttcc tgtacatcgt tctgggagtc tgagttaggg tttctatgct 46740 

gggataaaaa caccctgaca aaaattaacc tggggaggaa gctgtttatt tcagctttta 46800 

tgtctacaac atgacctgtc acccagggaa gtcagggcag aaattcaaac aggtcagaag 46860 

cctggaggta gaagctgata cagaagccat ggagctgctt ctggcttgct ctagcacaca 46920 

ggagtactag cctaggggct gtactgccca cagtgggcta ggctctccca cagcaatcat 46980 

aaatttagaa aatgcactac aggtttgcac acaggccaat ctggtagggc cattttctca 47040 

attgaggttc cttcttccaa aaggacttta gcttgcatta tgttgacata aaaactagcc 47100 

agcatattgg gattatagat attctcataa aaaaaagaca tttagattcc cacataacac 47160 

catattcaga aattaactca atgtgaacca gaagctctga aagtaagagt taaaactatg 47220 

aaaaattctt acaaccatcc ataacaaaaa tctgatgccc tcttctggag tgtctgaaga 47280 

cagctacagt gtacacacat ataaataaat aaataaatat ttaaaaaaat atatgaaaaa 47340 

tcaggctggt gagatggctc agtgggtaag agcacccgac tgctctttcg aaagtccaga 47400 

gttcaaatcc cagcaaccac atggtggctc agaaccatcc gtaataagat ctgactccct 47460 

cttctggagt gtctgaagac agctacagtg tacttacata taataaataa ataaatctta 47520 

aaaaaaaaaa aaactatgaa gaactatgaa ctacaagaag tcaggaatag ggctgggggt 47580 

gtaacccaac agaaaaacac ttgcctggcc tgcgtttggt ctctagcacc accaacgtag 47640 

aaagagaaca gcagaggatg agggcatcct gacttgagtc aagtgacaag tgataatcct 47700 

cgagacacca aaatcacaat gataaaagag atcaacaagt tgggctttat ctgaataaag 47760 

agctgtgtcg ttaaatacca cgcaggaagt gaagaggagc tgagtctggt aacacaggcc 47820 

tgaaatccaa gctactgggt ggactgaggg aggacaacag ctagctcaag gcccacctgg 47880 

acgccagagt taactcagag agcagcttgg gtagctttaa tgagactctg cccaggccag 47940 

tgcacaggag agatggctca gtggttaaga gcattactcc tcttgcaaag gacctgagtt 48000 

caattcccag cacccacgtg ggcacttaca atcatccata actttagttt caggggatcc 48060 

aatgcccttt tcacagtacc aggcatgtac acagtgcaat tacatacata catgcatgca 48120 

tgcatacata cacaggcaaa acttacataa aatactaagc agataaatct taaaagaagc 48180 

cgggcgtggt ggcgcatgct tttaatccca gcacttggga ggcagaggca ggtgtatttc 48240 

tgagttcgag gtcagcatgg tctacagagt gagttccagg acagccagga ctacacagag 48300 

aaaccctgtc ttgaagaaaa taaaaaaaaa aaagaaaaaa atcttaaaag aaaaggagag 48360 
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gactggagag atggctccac agttaagaac acttgttctg aggtctacag agtgagttcc 48420 

aggacagcca ggactataca gagaaaccct gtttcgaaaa accaaaacca aaacaacaac 48480 

aacaacaaca acaaaaccac ttgttcttac agaggacttt ggtttgattc tcagaatcca 48540 

catgatggtt cacaaccatc agttgcaggg atccaaggtc ctgtcttctg tgggcaccag 48600 

gcatatatgt ggtgtacata catgtataca ctcatataca taaaataaaa agttttaaaa 48660 

aggaggctgg gtttgtagcg cagaggtaga ggtaaaaaga ctctagcttg tttaatgttg 48720 

acatgaaaaa aaaaagacat ttagattcct gcatcacacc atatccaaaa attaactcaa 48780 

tgtgaatcat aagctctgaa agtaagaata agcctagtat gcactgtaag gctctgggtt 48840 

cactccccag cactgcaaaa gatcatgaaa ccagaaatgc agatcctctg aaccacagca 48900 

tgggaatgta actcagccga tgcagtgctc acctgtcgta tacagagcac aggataaatt 48960 

gattgtggtg gtgcatacct ataagctcac tacgtggaaa gtagaggcag gacgaccaaa 49020 

ggttcagtga catccttggt cacatagaga atttgaggcc agtctggtct gctggtctat 49080 

ttggaatgct gtctcaataa ataaaagaaa gaaagaaaaa gaaaagaaga agtcctatga 49140 

ttgtcttaac ctctgacctc tgtgttcatc aagtctcctc ctcaggaact cactggtcat 49200 

cttgtgaaaa cctaccccag agtctctgtt cagaggaccc aggctccagc tgtggttacc 49260 

acataggatt tttatactag aaaaataaaa tgaataagta tgtatttttt aaaaaggtgc 49320 

agagctggat atggtggtgt ctagttatag catccagaac tgagacagga tagccatgag 49380 

gttgagaaca gctagactat acggtctcaa caaacaaaag taagggatct gagtagatga 49440 

ggttttaatt tttttctttg tgtttgttac ctaacgtgta tggttgtttt gaatacatgc 49500 

atgtctgtgt atcacttgtg tgcctgaaac ccaaggaagc cagaggaggg catcgggtcc 49560 

cccggaagta ttattacaga aggttgtgag cagccatgtg ggtgctggga atcaaatctg 49620 

aaagagccac ctcgggctgg agagatggct cagtggttaa gagcactcaa tggctgctct 49680 

tccagaggtt tggagatcaa atcccagcaa ctacatggtg gctcacaacc atatgtaatg 49740 

ggatccgatg ccctcttctg gtgtgtctga agacagctaa agtgtactca aataaataga 49800 

tcaaaaaaga aaaaagaaac agccacctct ccactctccc tttttaaaat cctcttgcct 49860 

ctgtccctta atgttaataa cacaggtata tgatactatg ccttgtttat gaatagaaaa 49920 

tacacgtgct aaagcaagtg tgaaccttaa atacattatg- ctgagtaaaa ggagtgagtt 49980 

gcacacaaga cttttctgct caagagtatc tgtatgaagt attgaacatg tgaactctga 50040 

aatcgggagc tgaggaagat atggggagtt ctaatggcta" caacatttct ttttggaatg 50100 

atgaggatgt tctagaactc aaaaatggtg ataactcagc atatatacta aaactcattg 50160 

aattgtacac tttaaatgaa tgcaataaaa cttgtctcag taatgtggtt tagaagatgt 50220 

acagacatgt gtgtgtgtgt gttaaaacat ttcttggcat ggcaataaaa atacagtttt 50280 

agccaggtgg ttgtggctca aaaaataatg ataataacaa taataaaaat aatgaaaaca 50340 

gaggctggag agatggctca gcggttaaga acactgactg ctcttccaga ggtcctgagt 50400 

tcagttccca gtaaccacat ggtggttcac agacatctgt aatgggatct gatgccctct 50460 

tctgatgtgt gtctggaaac agctacagtg aaagtcattg caaggacttt acaatagtga 50520 

ccatgataac attgaagcta gacttgctac tactgctgag tgtgtctgct ggctctttct 50580 

aaggagtaat gttagctttt tgtcctaaat ttgtttcctt cctttcctct ctccctctgc 50640 

tgttttttct tacccctctt ttactttgct ttcccctctc atctcctctc ttaacagagt 50700 

tgtcctatgc agcccaaatg ccatcttcct gcctcagcct ccccagtgtt gaaaaatact 50760 

ctttccacag gttatgttag gagactggag tctgctcagt cggggaggga gcctgggtca 50820 

agttctgagc tcaattcctt ttctttcttt ctttctctct ttctttcttt ctttctctct 50880 

ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt taagacaggg 50940 

tttctttgta taccctggct gtcctggaac tcactttgta gctggcctgg aattcagaaa 51000 

tctgcctgcc tctgcctccc aagtgctggg attaaaggtg tgcaccacca ctgcccagcc 51060 

ctgggctcaa ttcttaacat tgtggagaga aaagtattgt agctgttctg gccacctgga 51120 

attactttgt ttctgatctt ttgctgcagt caaatccttc tcatccatct ttcctcgtca 51180 

ggctataata tagactctcc ttgcaatact tggaaatgct ctacagtcag ctacatcctc 51240 

agtcctgctc ctatattttt tcctaagctt ccttctaagg tctttattgg tttatgattt 51300 

acacagaaca tttttttttc ttgtctatag catgcgttag agtgatcgtt gccagataga 51360 

ggaaagagaa atgagagaan nnnnnnimnn nnnnnnnnnn nnnnnnnnnn nnnnnnimnn 51420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt 51480 

cagctactga ttcctcctcc tccctcctcc ttcctccctc ctccccagcc tcatgctctg 51540 

ctcatcttgg acttctgcgc atgtcctcag cccagacctt ctgctcttgc ttctcctctc 51600 

cccagcagcc ccccagttct cttcctgaaa cttctgaggt actctccatc acctcctttg 51660 

gctcctgctc tgattggtgt cacctgctgg ataggcttgc tcctgactcc actgttcgtg 51720 

tctcaattag ggaccctcac cctctgatat accacacatt tccctagtgt ctccacctcc 51780 

cacccccacc ctatacgcac atacacactt agctgcatca ggatcctaca ccagggactt 51840 

cttacccttc taatcctccc caccggacac tgcccaggga cactggggct ccagagggct 51900 

attgccacac ggacacacag gagatctcat caaggagatg tgcctacccc agagggtagc 51960 

tctcaccatt cacaagcaca ccacttctgc ctccagcttc tactctctcg caggaagtag 52020 

ccagcccggt gccaagtatc cccaactaca tccccaaaat tctcagacac tgccagcctc 52080 

cagctgtcag cctggccccg gctggcgggc gcctgctcct ggcatagcga ctagggtgta 52140 

attagaaacc cgctagctcc ctaattgcca gttctgagct gtccttgtta ccggctgccc 52200 
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gaggcacaca tagaggaaaa ggctgagagc tgagccaggc tggcatggag gtagccctag 52260 

tagacctaga gaggactggc atgtggccag ggaccaaacg tggcacagag agggctcagt 52320 

gcaatctgcc ccgtgggtgc ctcccagcca catccatttg cccagaactg tgacgtcaaa 52380 

ccagcccggc ccattcattc tttattcagg tggcataaaa atcactacaa aaactttaca 52440 

aaagagtctt gggagctaaa gggtcccttc cttgcctcag tccccaagat tcctggcagg 52500 

ggaggacaag agagagaaga aggaggaaga ctcctggcag tgttggcatc tccaaatacc 52560 

agaggggtga cttgggtgac aggacacagg ttggggacct gaatgtcttc agcaagggac 52620 

actcttgtag ggtaggtcag cctccaacca tgaagtataa caccaaggcc agtctaagct 52680 

tgggagacca acacttgtct ctccttttcc cacccagggt gtctggaata tgtctaaaga 52740 

tggcctctcc agcctctgct tacaaatgtg gagggaccct aagttaggga cttgcctaac 52800 

ctacctctag ccaaaactgt gtccacaagt gccagcccac aaaagatcac cccctgagcc 52860 

ccttgggaag aaatgaagat tccccatgcc tgccttcctc caggccccac cccacctgct 52920 

gcaagagaac agcttctaca ctggtgatgg tccttccggt cccaccctat cccacaaagc 52980 

tggttagaaa gagtcacagg agctgagagg ctgatccagg tggggactca ggatgctgct 53040 

gcccagggcc cctcctcact tgggggagct gaactggggg tagtcttcct ccatgcgggg 53100 

tgcaagtttc aagtcaggac caaaggtctt gcctccatgg aagtcagctt tgtcattctg 53160 

gcctatgagc ctgttgtcag gggaatctcg ctgttcctgg agctggggca gcgcgctggg 53220 

gttagggttc ctcacactgc ccacaaagag gggcacgcct atggtgtcct ccatgatgaa 53280 

gaagaggaag ggtcggttca cagtgaagga ggagagggac attcgattca tggctacgct 53340 

ggtagctgcg gctgcctcca caccagcctc gctgagctcc atggtagact gatgttgcac 53400 

gctagacacc accagattct gctcagagat cccacgaagg tctgggccct ggaacaattc 53460 

ctgcaggcct gcccagaaca gcagatgact ggtcagtgct gccccaaggc tatgtggatc 53520 

tgtctagcat cctggctaaa gggaacactt gaacccagcg gttgattgga atctgttaga 53580 

cctcagtcta gacaacactt ctagaaacct tttttttttt tttttttttt ttttaaatca 53640 

ggatctgcgc taggtacagg acagaaagtc tagaggagca tatcaaatgc tcccatccag 53700 

gaagcagggc cacctctggc tcaggcacac tggcagctcc cgtactctgc ccagaccacc 53760 

taggggcacc ctatccccaa gctccttacc cagttggctg- agggtggcca ccaggtccag 53820 

ctgctgttgc agatggagtt taggcagcca caccttggtg ggcctctcct gcagcgaggg 53880 

atggtacaga gtatcccagg tcaggttggc tagtacctcg gacacgttcc actcaaaata 53940 

agtgggcatc acgaccacaa agctcatgtt gttcttaaag gggaaatgag ccacctacag 54000 

ataagaaaag gagagaacat gaggaccaga cagcacctgg acctgtctgg agtctgggcc 54060 

aaaattactt ctgtactttt gagacaagag ccagaaattc agggttagca tgctttcact 54120 

taactggtga agtggaataa taccacttac ccctttgcaa ggtgacatgg gaccaaatga 54180 

gataatgctt ttacacctct ctgtgtgcac acataagcat atatgtttgt atcggtgtga 54240 

gtgtgtttgc tcatgggtat atggagtcag aagtaggtaa acatcagtcg tcttcctaca 54300 

ttgctctcca cttttttttt tttttttttg gtgttgccat ctttttgttg ttgttatttc 54360 

aagacaggct ttctctgtgt agccctggct gtcctggaac tcactctgta aatcaggctg 54420 

gcctcgaact tgcagagacc cacctgcctc tgcctcctga gtgctgggat ctaagatgtg 54480 

tgtaactaca catagctccc tcttttttgg acacagggtc tcatggatcc caagctggct 54540 

ttgaaatgac tgtttggggc tggagagatg gctcagcggc taagaacact gactgctctt 54600 

ccaaaggtcc tgagttcaaa tcccagcaac cacatggtgg ctcacaacca tccgtaacaa 54660 

gatctgactc cctcttctgg agtgtctgaa gacagctaga gtgtacttac atgtaataaa 54720 

taaattaatc ttttttaaaa agagaaagaa atgatggcta catacttctc tctcgtctct 54780 

ctgccccaag tgctgggatt acagagctgt acaacaagcc caagtttgtt gtgttttaga 54840 

catgctaatg tatcccaggc tgtcctcaga ctctctatgt aattcagaac gaccttgaac 54900 

ttcttttaag gtttattttt atcttatgtg tatgggtatt ttgcctgagc atttgtctgt 54960 

gtaccgtgtc cttgcagtac cctcacagtc cagaggaggg caccatttcc ccctgaactg 55020 

gttgtgagct gcatggtggg tgctgggaat caaaccctgg tcctctgcaa gagaagccag 55080 

taagtactct taactgctga gccacttctc caccttgagc ttttcttcct cctatctcga 55140 

tctaaaagta ctagggatgg cggatgtgcg ttcatgtgcc tggtttatgt gttgctaagg 55200 

gttgaacaaa gggctttgtg catgccaggc aagcactcaa caactgagct acacatcccg 55260 

acagactttg actcttctag tagtagtgtc tccactacag cctgagttct ctatctgctg 55320 

tcagcaagct gtacaaacaa gctatgggcc ttcctgtcct tgcctctcag ttctctccgc 55380 

aggtggggct actggctttc aaaatgaccc atagaggagc cacagcaaac agtaggaagc 55440 

ttgcccctcg tctttcaccc tctcccagag agtcagctat aattcgagtt tttttttcct 55500 

ctctctctct ttaaacagga tctggttatg tggccctaac tatcttcaac ttcagtcttc 55560 

ctgcttcaac cttctgagtg ctgggattat ggtgtaagcc accacactca gctcacacaa 55620 

cctttttttt tttttttttt tttaaagaat ccatgcagtt aggacagcat ggaaatgacc 55680 

aggctcaggc ctccctgggt accagcataa tgcctgcagg cgggtcctct gccagtgggg 55740 

ggatggaaag atggagccag aggatctttc ctctctgaac ctcaatgtcc cacagtgaga 55800 

cactcatgtc cactgggaga tactgtagta ttcaaggaag aagcaacagg aaggtgagag 55860 

ctaagtggag ctgagcaggc tcgtatcctc tcaccacggg ctacagagaa gtctggctgc 55920 

cccctccaca tggctcctcc ctgcagaact ggcaatgctg ggcccggctt gcccagtcaa 55980 

actaaccaac agaatggatg agcatgtgtg gtgccacaca cctgggaccc cagcactcag 56040 
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acagctgggg cagaagggtc atgagtccaa agcgaacttg tgtaacattg tcagaccctc 56100 

gaacaaacaa aactagcccg tcctgttatc tcagccacag atgatgggcc caaggatcag 56160 

tactctagcc aaggagtcac ggttaggcta gaagcaaggg aagccttagc tgagacagct 56220 

tggcacggag cttcatccaa tcagaatgtt cagagcaata agctttgaaa cccgacttcc 56280 

atctatgaag cactgtgtgg gaactcctct cttcccttac gagcagggcc ctggtcctct 56340 

tgggctccgc taaaacccca gcacagagaa cagttacctg gcacgtgaca aaaactcaat 56400 

atattttctt tgaggagatg aacctcaaag aagctgtgtc ctggatagac acagcataat 56460 

aaacccttca ggagctacct acccagggac cagactttac ctcccagtac caggcctcgt 56520 

ttgccagcca aaggcaaagt ccagactgac ctgtatctca ggttgctcca gcaggaacca 56580 

tcgaagagga tatgacaccg cgtgcatcat gtccaccgac actgtgaacc gctcatccag 56640 

gtggaagaaa tctttctggg tgaggctcgg gtcaaacttg gtcctccaga aacctgcagc 56700 

caggcagagg gcaggagcca tgtaacataa aatcagcctn ctgcctgtct tgcctagaac 56760 

ctatnnnnnn iinnnnnnnnn nnnnnnnnnn nnnnnnnnnn nrmnnnnnnn nimnnnnnnn 56820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaaccaa ggcaggtctt 56880 

ggaaaaagga atcttaaatt agaagatgcc ttgataagat tggcatgtag gtatgtctca 56940 

ctaatgattg atgtggaaag tcacgaggga tggtgtcacc ctgggcagat ggcctggggt 57000 

atataaaaac acaggctgaa caaaccacaa agcagtagtc ctcaatggct tctgctttag 57060 

tttctgtctc aggttcctac cttgacttcc ctcagtgaag gcatgtcaca tgagagttgt 57120 

aagaggaaat aaaccctttc ctccccacat agtttttggt tatgatgtta tatgtcaaca 57180 

acagaaacta taactaatat agttggtttt ctttttttgt ttgttttgtt ttgttttgag 57240 

acagggtttc tctgtatggc cctggctgtc ctggaactca ctttgtagac caggctggcc 57300 

tcgaactcag aaatccacct gcctctgcct ctgcctccca agtggtggga ttaaaggcat 57360 

gcgccaccat tgcctggctg gttttctttt ttttttaata catttataat gcattttaga 57420 

tttaaaaaaa aaaatggcca tggcatataa tataaaaaga agtgcttaca aatcaccatg 57480 

tgcccttgcc ataaattatg taaaaatttc catatggaca tcagtctcaa gcttacaatc 57540 

tcagcactca tgagcctgag gcagaggcag gaggatggtg agctcaaggc cagcttagtc 57600 

tacataacaa gatcctgtcc aaataataac aacagtaata- atttcataca tagaactaga 57660 

aggggccact gcaaagacag tatgacaaaa ccactggccc tgcctaattg tattttaaat 57720 

aactgtcctc ctctctgtaa ttttcagttt ctaattttta* cataactacc atgtattctt 57780 

tttgtaattt taattagttt tttaataata gaaacaagct aagtgctaag aatattttca 57840 

tatgaacatt ttcaaggcac ttgatacata cctcagattt gccctccagg tgagcagtac 57900 

caattacgtg ccaccagcaa tgttagcttc cttttttccc taccatctga ttctgtttca 57960 

gtctattcgt agttctgatc ttgttatatc cctttttatt gtttccctgg gttccaacac 58020 

ctcccagttg agtgttctca ttgaatttca ttagcagctg tttcattaat ggcacagaag 58080 

aaggattaca gtgttaacta ggatagactt tgacaaagaa ctatgagaac atatcttatt 58140 

atctttgcat aaattctttt taatcaaagt tcctcaaaag cctctctctg ttcccatctc 58200 

agggagtagg tctggccact gatgagtgtc caggccacag tacaggtgtg cgtggttctg 58260 

tccctgtggg aagggcacat ctgtgttgta acaggattcc tgtcttaaca agccttgctc 58320 

aggctctaag tggtcctgag ctagctaact gcccttggct ttcccttgat taccagataa 58380 

ctattcactc ttctcatttt gcagagcact taccaggtag ctatgtcctg gaagtacgaa 58440 

tgagtccttc tattgttttt cttttactta aatcccattt gaaatgcgcc agggacactt 58500 

caatccaagg tacacttttg ctaaagaatc actcattttt atatgcaaaa tgtcacctat 58560 

taactgcagc tgatatggta catacatatt ctctcttcct attatccact aataggtgac 58620 

taatgcgaaa tattgagtaa tttttaaaaa tcaatactca attttttaga aataattaga 58680 

gagacattca actctgacac cagcacccta ctcagttcct gagccttcct ctgccggagg 58740 

agaatctata aataactcac gaagctgaca ttactcactg tgttgcagtc atttttttct 58800 

gagaaaattt tagcaactgt tctaatagag cctgccagtt atcagtagtt gagaatgcaa 58860 

gtcaactttt aattatgcag acgctgatta ttcagacgac aaattgttgg tgcctgcacg 58920 

gctccttcct gctgcctacc tttaaccgtt ctcagtgctc attagcacat gttccagaag 58980 

gtaggctttg gaggggcgga caggcactca aaccagctaa gcacttagag aagctctgat 59040 

gaaagatgtt aatgcagttt gtagaattat tgactaaaat tgagtcattt ggattccctg 59100 

tgaattgtat ttacatgccc tgtccctgtc ccccatagca acagataata ggattgtctg 59160 

cagagagaca acatagttct tatatttaat tttttccttt gtcgaacatt ttcacatgat 59220 

ggttcgtggt gtttcctttg ttcattacat ttgtatccag actagttact tctgataagc 59280 

ggttagttag gattcctggc acgcggacag tgacaccaca gttgtctgat cgtttcccac 59340 

ttttttacaa aaccgtttgc ctttaagagt cagtgttttg cacatttcac ccagattatt 59400 

ggaaatatta tttccctcct gcttaaaccg aagctgtgat cataatttaa gcctttctag 59460 

gtagccgatc ttacatgtat catacctatt tctggcatat gtttgtctat tacaaagacc 59520 

tcgtaggtat gcagttagaa gcctctagtt aaatgaaatg ttgcgtgtgt gatgaacctg 59580 

gagtggggat ggccttttgt gtgccccaag gctgttgtgt ttcacacagt tgttttctgc 59640 

ctcctctggt ctatcactat cctgccactg ccagaaaacc ctgctgtgtg ttccccgcgt 59700 

ggaggatctc tgcttctgaa cttctttggc ctgagaaact ccataaccaa atcagttagc 59760 

attttgttta aagagcaggt aggctgttag agcttgggtc ttacatgtct cccaggtcca 59820 

cttgccagcg ccttgaccac tgttaacttt tgttaaccaa ctcatctttt gctgcctgtt 59880 
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ttttgggggg tttttttggt tttgtttaag ccaagatcag ttatatggcc caggctgagc 59940 

ctctcttccc agcctctcaa atgttagaat tacaagcatg catccctcag catacctttc 60000 

ctttgctttt tttaaaatag agttttgcca tagcaacaga aatctaacct aactaagcat 60060 

agccgtgcac atggtatgag gaactcacat atgtgtgaat ggaagttcat agagaccggc 60120 

atcactgcct agaggcccct ttcttccttc cttgcagttg tcgtgctagc tgactgtact 60180 

acaaaagagg ttgtctgagg cataagacta ccttcaataa aacatgcaca gacagtttgc 60240 

ttctctgaga tttcagagca gtgactacct tcaataaaac atggacagac ggtttgctta 60300 

cctgagactg cagagcagtt tccaaaaatt ttagacaaag ggtaggatga agaaggctgc 60360 

ggggttttgc acacacttaa ggtgcgtaag taaataaact gagctacact gacaggatgc 60420 

tcgttctagt agccaaccaa agagcagttg aaccaaagca cctagacttc aaacatcgtg 60480 

gggagataat cttaggagtg ctatgcttct gcgtcctaca agtattatga aactgtctag 60540 

aaagcacccc actggtaatc cctttttgat tatttttttt ataaattcta gtcttggggt 60600 

tttgagtggc acacagacat aatggttagg cttcggtgtg tgctcattca ctttgcttcc 60660 

tggggaccag agtttgcgat gagtcatgtt ccatctgatt tctgtcggat ccggctgcag 60720 

agccatgact cagatgggct tcaggcccag ctgctcagtt catcttctgg ggaatagatg 60780 

acaaggacgg gacaaatgtc ctgacgcaca tttccttctg ttcttgcact tccagggtct 60840 

aacgagagca tcattaccaa cagcaggcag atacgccttg ccacaggcat cttccctgtt 60900 

gtcagcctcc tgaaccactc ctgcaggccc aacaccagtg tgtccttcac tggcactgtc 60960 

gccaccgtcc gggcagcaca gaggatcgca aaaggacagg agattctgca ctgctatggt 61020 

gagccagcct ttctttccac taccctgctg tgcctcacac ctcacatgaa aaggataagg 61080 

ggacaggaat cagcagatat gggcccagtg cctctactca tcctctgagt ctttcctgga 61140 

aagggcaatg catccttggg ccaataaaaa aggtcttctg gctgtaataa aaaagcccgt 61200 

tgagggcagt gagccatatc cctccatgcc ttgtagacag cctatcctga aaatgagcga 61260 

ggagcacttt cttggcttct ttcttcctgc cccagcagct tggaaacgta tccactttca 61320 

cccgtgtttt gttgtttttt ctgagatgat agggcagagt acccaacctc atataggcta 61380 

ggctagtgtc tatcactgag ccaggacccc aacccagcac caccatgcca gtcacgtgat 61440 

gactaggcca gcccctcggt agagtaggca ttgactctct tggtgtgact aggaactgtg 61500 

ggtaatctct ctccagggcc tcacgagagc cggatgggcg ttgctgagag gcagcagagg 61560 

ctgagttctc agtacttctt tgactgccgc tgtggggcct* gtcacgctga gacactgaga 61620 

gcagctgcag ctcccagatg ggaagccttc tgttgtaaga cttgcagagc gctcatgcag 61680 

gtaaatctct gctgttccca ggggcagggc tccagctaaa ggttgtcagt cgccaggaga 61740 

accattcctg cttcccttct tgtaactcct ccctacatgt cgcccggtcc tgcagaaaac 61800 

acaggttgta tttcctaata ttttccctat aagtgacaca aaatcttaaa ttacacaaag 61860 

ggaccaaaaa aaaaaaaaaa aaaaaagccc tagaaattta cttgctcaaa taagtcatca 61920 

aaagttgtgc atcaggccta gcacttgggt actggtaacc ctagcactca ggaggctgag 61980 

gaagaaggat ctcaagtcgg aggccagtct caagtgacac cccatctaag agatcaccat 62040 

tccaaggagc tatttcagag atggtttaat ctggggaccc agattgtgga ttttctgtct 62100 

gttcaattcc atctctctgt gctggcctca tcagacacac tctgtagtaa ctgtgggaaa 62160 

atccgaccca catagttttc cctcagcctt tgacccagag ggaagagcca cagtggagag 62220 

catgagagca gacccttggg tgctactgcc aggtaatggt gtagacactg gagtcttcaa 62280 

cattcatgcc ccaatgcaaa atggtctcca caccagagca tggcattctc attagaaata 62340 

agtaaatgga attggctgtg ttgaaaattg taaagccaag ggtcaagaat gaagccttcc 62400 

ccagcatgtt ttgttttgtt ttgtgtttta ggcagcgtct ctctgtgtag ccttggctcc 62460 

tgccctctgc tacctctccc aggtgtgcca ccatgctggg cctaagcgcc ctgtgcatta 62520 

gtgctccctc gatcctgctc actcttgaga cagtcttcct tctactctgt atccccagat 62580 

aacctagagt tcacttcaga gcccaggctg gcctcaaact tgagatcctc gtgtcccagc 62640 

ttctcaaatg cagtgatatt tacaggccta cacctggctt tccctgatag attcctagta 62700 

agatgattat cctttgagcc atatctctct tctgcttctt cctctcttcc tgcagggttg 62760 

atctagaatt tattctaaag ctgactggcc tcagaattgc catccttctg cctttagnnn 62820 

nnmmnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 62880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaca tagtgtcaac tttcaaattc 62940 

tgccttaaga gttctttgtt tatgggaatt tatgggaatg ttccacagaa cccatccagc 63000 

ggagttctgg ctgttgtttt ttaatcttta ttcatcttgc gtgtgtgtgt gtgtgtgtgt 63060 

gtgtgtatgc gcgcgtgctc aacttgcaaa attgcaaaat tcagtctcct ctttccaccc 63120 

tgtaggtcct ggggatcaga ctctgttagg cttggtggta ggtgctttac tgagccatct 63180 

tacaggcccc ccatggacaa ctttttcttt gaaaacctgt ttctggcttg ggtgtgatag 63240 

ctcacacctg tgaccctacc accactcatg aggaagaggt aggaggacta acagaattgg 63300 

aagccagcct ggactacaca gtgagtaaag gctatctata tactcaccac atggcaagac 63360 

cccgttttaa aacactgggc aaggtgaaac aaaagtcaat taatttcaca taaagtcaat 63420 

agcttcatta acggcctagt tatctttaaa actgtatgca ggttagtact tggtttcaat 63480 

tttattactt tttctctgga acatttaaaa gtactttagg ggctggagag tcagttaaga 63540 

acagtggctg ctgccaaagg actggagttc actcccaagc acccaggtgg caatcacaac 63600 

tgtctgtcat ctaattctag gggatctgac accctcacag actcacaggc agtggaacac 63660 

caatgtacat aaaataataa ttaaaaaaat gaaataaaat accaggcaag gtggcacacg 63720 
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cctttaaccc cagcactcag gaggcagagg caggcagatt tctgaattcg aaggcagcct 63780 

ggtctacaga gtgagttcca ggacagccag ggctatacag agaaaccctg tctcaaaaaa 63840 

aaaaaaaaaa aaggacttta aattgggctg gagagatgga ttaaaagcat tggctgctct 63900 

tcccagaggt cctgggttca attcccagca ctcaaatggt ggctcacaac tgtctatatc 63960 

aacgcaatct aacaccctct tcaggcatgc aggttacatg tagacaaaac atccatatgc 64020 

ataaaataca taagtaaatg agtcttttaa tgtatactag aagctgggtg gtggtgcatg 64080 

cctttaatcc cagcacttgg gaggcagagg caggtggatc tctgagttgg aggccagcct 64140 

ggtctgagta aatagagcct tgtacttcta cttatcacta cagttacatt ttataacttt 64200 

gggccctagt gcttccattt tccactgttt gcttaaccac tggggcctga agcttttgtg 64260 

ctgacacttt tgttcgctaa tcatcaggca accaatggtc tctacactcc atcaccatca 64320 

acacaaacaa aacaaaacac aacactacgg atcctggcat ggtggaacat ctttagcccc 64380 

agtacgtggg cttgagttca aggccggcct ggtctacata gcaagttcta ggatagtagg 64440 

gatagtcttt aaaacaaaac actattttat ttatgaacaa aacatgtaaa gaaagaaaaa 64500 

aaactgcaaa tttatctatg aatgaagtct aagtaatact tcaatattgg aaatagcttt 64560 

ctaaaatatt tttatttaaa gaaaactcag caaattattc aaacaacctt ataaacgttc 64620 

gttataaaag taaagaatta tttgcaattg ccttaagggt ccaaggtggc agcctcttaa 64680 

aattcagaac aatccaagct tcacattcca gttcaacatt tctacagccc taacgtattc 64740 

aaatacctcc attctgacaa ctgtttcccc tcttcttttc ttctaagctg cttagatgtc 64800 

tgtcccaggc ttttcatgat tttagtcatt cacacaacta gcaaacatta tctagggact 64860 

aaaacttgcc agatactggg atatcaccct aaagggggac tgaaagtagc tgcaggctac 64920 

agtctctaca atctcctgaa tgaaatacaa agtagctaat atttaccaaa taaacatgta 64980 

cacctgtgat gattgctagc tgtactagca gaagctaaac actaaatcta gaaactcagt 65040 

cctccaacta gccccttgct cggcttcagc ctcattttta caaacaaggg aaagagtttg 65100 

gaatgttgcc caaagccata cataagtgaa caaaaaggag ttggagtctc caaatgcatg 65160 

gatttgggct agttactttg ccaaccaact cagtaacaac tgagctgaac aggaacactg 65220 

tggtagcaaa agaaactgga actatcaatg gcctctagag caaaaatata tttaaaaaga 65280 

aaaaaacaaa caaggcctgg caaggagact gtgagaagag tgtgctgact gaaattgact 65340 

agttcagcca acaaaagact attccagggc tggtgagatg gctcagtggg taagagcacc 65400 

cgactgctct tccgaaggtc aggagttcaa atcccagcaai ccacatggtg gctcacaacc 65460 

atccgtaaca agatctgact ccctcttctg gagtgtatga agacagctac agtgtactta 65520 

catataatca ataaataaat ctttaaaaaa aaaaaagact attccagtgg ggatggaaaa 65580 

gttaagtgtg gagttaaaat atacttcaac tggtgatgga ctaggtgtcc agagtcgggc 65640 

aaaaggatgc tctgtggtag aggtgcctgc tgtgtaagcc cagctacctg agctcaatcc 65700 

acagaatcca cagcggagtg ggaagagaaa caacgtccca gagttgtcct ctggcatccg 65760 

acgcacattc gccatcccca agatgtcata catatgtgta catactacac actggcgcac 65820 

gcgcacacac actctttttt aaaattcaga cttagaggga cataaaggat ttgctctgat 65880 

atatgttcaa ttgaaaatga ctttgaagat agagggcaga tcgaaggaag ctcagcagga 65940 

aagaattaat aacatgcagg tgaagggcta taaactagtc tgcagagggc cttggctcga 66000 

caaaaaaatc tatggggttt gccggtaaaa taaggaaaaa gttgtcaaca tgaaacacag 66060 

aacactagca agagaggagt gttagcagaa agaagccaac aagctcaaac aattaggtcg 66120 

gctgaaaaat tttaaaatgt cttctgattt ggctactggg aagccactgg tgacttcggt 66180 

cagcgttttc tctctcgtga ccagagagat gtctagtagc aataatgagt taggaggatg 66240 

taaaagaagt aaaacagccg aaaacaagtc caaaaagttt ggggtgatgg agaaagggag 66300 

gaaacagagg ccgccgaaga tagacagcgg catgtttatt tgtcttgttt tcttagatgt 66360 

aaacaaacta aaaaaactcg tgagttcttc tgccagtacc gggttgcctc cagcatcctc 66420 

tgatggtctt agagaccccg ggatgctccc ccgcggccgt ataatttcct ccctgacgct 66480 

ctcccgatcg acagcggctc cctccccggg tcctctttgc accgctccaa ggccgcgctg 66540 

ctagggccat cgagcccgct cagggtcgtc tccttacctc gatggccccc tcgctcaggt 66600 

gtcccaccat ggctgcaccg ctaactcccg cgctcgcgct cttgcaccgc ctgagcttct 66660 

ctgccggggt cccgcgggct gctcaacgat tggctagagc aactgtgcgt gccgatccgc 66720 

ccccagcgtg agcgcggtgc gaggggcggg cctagacgcc gatagccacc gcattggcta 66780 

ccgcgcggca ggcagagcac gtgactcttc cgaggccggg ttcgaggcct agtggcggga 66840 

tggcgggacg tgagggcggg gcgctgggtc gcagtgcgcc tgtgtcagcg cggtgctact 66900 

gagttgttcc cccgccagct gtcggaactt tgcccgccca gtcctttggc ggacagacag 66960 

aatggcaacc cagggaacag tcggagctct cccctggtaa ctgctgctaa atatagtcaa 67020 

agcagtgacc tgggtacttc ttcacgcagt gcgtgcccgg cgccggtgcc aggcccagag 67080 

cttggcactg tgggataaac aaggtaaatc agactcagtc tccgccctct tgagttccac 67140 

ctgagagttg tggccgcaag gaacccagcc tcaaggatgg tagacgcgat atgggccaca 67200 

catgtggagc tccagagtgg gggtcaaaaa tcaatcaggc tttcgagagg cgatgcggtt 67260 

tgaactgagt taaagtgtgt gtagaaattt gtcaggtgga ttccagtgag gatagtgatg 67320 

ttcctaaaag cccaaatggc ctatgcaaaa gtattggaga gcctggcgtg ctggctggct 67380 

ctgatctgtt tgtaatccca gcctttggga tgtagaagca gcaaaagttc aaggtcaccc 67440 

ttaacaccgt tgagttcgag gtcaacctga actaaatgag accctgaaaa atcaaaattt 67500 

gggacccagg cgtggtggca ttcgaggtaa aagcaggcag atctctgagt tcgaagccag 67560 
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ccaggctaac ataagatccg gtctcaaaaa aaaaaagtaa taaaaataaa aagggagaga 67620 

ggctatatga actgaaagaa agacctggag atcaaaacag aaaactgagc cgtctaagaa 67680 

atgaaaatat ttaacttcat agttgctgga gtaagaagtc tggaaaactt tgggcaacta 67740 

aggtaaacag gtctagaaag actggaatag tagccatcta ctggtatttt gatctctgtt 67800 

tgtacaacca caacctacta tagtttctca aacagttcca aagaatatgt ctgggtgaat 67860 

tggtaccaca ccacagatta actctccttc agcatatcaa cagctataga aaaccccaga 67920 

agaaatgatt ttggttgcgt gtcacttggt aggatgaaat ctcgattttc tagaactatg 67980 

cattaataga aagctgaatc ttcatgttct gactttacag agctgcggca gcatggatct 68040 

accggtggat gaatggaagt cctacctact taagaagtgg gcttcactcc cgaagtctgt 68100 

gcaggacaca atttctacag cagagacttt gagcgacatc ttccttcctt cttcttccct 68160 

tcttcagtaa gtgaatggaa acttcaggga aattttggtc tggaaaatgt tctgccttgt 68220 

catttggtct gaatatctct tttttatagg agagagtagc tttatattct ttatagtatg 68280 

gggcatttag cagttactgt tggttttcac gtttctccct agtctgtgat tactagaatg 68340 

ggtaggcact aactgctttc ctcttttggc atgtgttata cttaaggaat gtagtatctt 68400 

gctgtcgtcc cagtgctgtc actcatagga tctggtgcag gttgtgtagc tgcccctaga 68460 

agctcattca gtcctaatgg ggagaaagaa ccctggcact tggttagttg agacccanaa 68520 

cttctcaagt tctnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 68580 

nnnnnnnnnn nzinnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnntttagtt 68640 

tccaagtgcc actttactgc aatgtgtcac cacatccaga gttctgtgtt tgtttatttg 68700 

tctgtttttg agacagggtt tctctgtgta gccctggctg tcctggaact cactctgtag 68760 

accaggctac cctcaaattc actgagatct gcctgcctct gcctccagag tgctgctgta 68820 

cactaccacc acccatccag ctttctatat tggttttcct atggcgtttt aaaatagcca 68880 

ttatacgtgt gtttatcatc taaagtcctg gtcccaaaag gagatgagag gggctgctaa 68940 

ggtgaaaagg attacaaacg cttatcaatt ctgttcaaaa attaaacctc agagtgggat 69000 

tagctgcttc tttcattaga attgctatca gaattcactc aggccttgtt tgcgtgtgtt 69060 

attgaagaag tctcttcctc atcaggtcag tgactcctta gctcaagtac atgcaatatg 69120 

cagtattgat aactgttctc gcttaggaat aaaaatagaa- ctgacttcca cagggaaatg 69180 

atgtgctgag ctgtagcaac gaatcttgca caaactctgt cagcagggac cagctagtct 69240 

ctcgcctgca ggaccttcag caacaggtct gcatggccca gaagcttctc cgaaccggta 69300 

aaccaggtga gattggctcc ctcgccctag gcctcagccc ttcccttgtt tattttggta 69360 

tcaccttgcc ttactgagca gtcctcaata aatgactgag gacttgaatt taattatccc 69420 

agcaccagcc acaagatggc tatgtaggcc agtgagacca gactgtgacc agctgttact 69480 

ctggtgccct tgaaagtctt cctgatggtt taagctgtgt ctgctgcgcc agatagttct 69540 

agcagctcga gcaccagaaa ggctgtctga cttccatggg ctttgtgtgg ctccagaggt 69600 

ccaatgccat catctgattc ccagcttaag gacctaagct ccgagaaggt tgctctgccc 69660 

tcagcagcag cagcaagtcc tgagtgctgc ctgggctcgt ggtgtgactc aggagtagag 69720 

ctcggtagct agcctgagct gagagctgag agaaagaaag gactcctctc ttttcagaaa 69780 

gggatttgca gaactcgatg ttagaccctg acatggtagg aatctgtttt gactattcta 69840 

gcctagattc tgaagttgac ctttagccta gagtcaagaa aactaatgat tacaggagga 69900 

atgtagagtt ggttgttaaa tgttggttgg aaaatggatg ttagaagccc agggtaaatg 69960 

tgaggaagcc tcatctaaca cctcttttac tgaaagagaa aacataagca accaacagct 70020 

tccctggaat gcccggctgt tgactccgtg agataaagag gcattttcac tttgacctaa 70080 

ccgatagaga ccttgcaacg tggtctctcg tgtccaggac tagatctgta tctgttgtga 70140 

ggcatttttc ctttgaatcc atagagcaag ccattcagca gttgttgcgg tgccgggagg 70200 

ctgctgagag cttcttgtca gcagagcaca ccgtactggg ggaaattgaa gatggcctgg 70260 

cccaggccca tgctacctta ggtatgctac cttaggtata gccggagttc tccttccctg 70320 

ccgtgtgttc agtgcggccc ttgccttgtc tgtttggttc tctcttgcca tctgaattga 70380 

cgctcttctc cctcccattc tgcattcctt gcccccagag ccttaggcta atggtgtttc 70440 

ttttccggaa tgagacattt ctcttctcac agggaactgg ctaaagtctg ctgcccatgt 70500 

acagaagagt ctccaggtgg ttgaaactcg ccatgggcca tccagtgttg aaattggcca 70560 

tgagctcttc aaactggccc aagtcctatt caatgggtag gcctttcttt ttcctagtgt 70620 

ttggccaggg cacacagtgc tctgtgtttt cctaggtgct tctgtgtatg gctttttgct 70680 

acagtgcttt aaagcatgtt gaaactcttt tatttcctct ttaggacaga tatttgccct 70740 

ctgcttcact gatagacttt aagctttgaa ttccttcctg aggatgtgga gaaagccatt 70800 

aggtctgcat ggagcttccc agggaggatt tggaggcagc ctcacccgcc tctagcattc 708 60 

ctgtctgctt aatcacacct cccttggctg cctcagtccc tgctctctca actccagggc 70920 

tcggcccttt ccctggtttg cctcttattc cttttaaagc agtggttttc aactagaagg 70980 

gattgcaaat ggcatttggc agtgtttaga gacagttttg attgttatgg ctgccagcat 71040 

ctagtaaagg ctaaacctac agtgcacagg accgcctcca cagtggagag acccaagtta 71100 

gctatgtgaa ggctgagaat ccctgctttg gagattaaaa aaggaagctg agggaaccac 71160 

tcagttggaa gcacccttgg tggcatgcac aaggccctgg ttctgtccct agctctgcac 71220 

aaaaaataga atacaaggaa gagtaaccct aatgagctgg tccctcaccc agtgtgccac 71280 

tgaggtcact tgaagggaag tctagcccca atttagtatt ttttgtggct gccatacctc 71340 

cagccttgat caaatctcat ggtatacatt ggtaagaaaa agggtttgaa acatagacct 71400 



87 



wo 02/29059 



PCT/USOl/31488 



gatactcgga catggaaaca gtatgtttgg tcagagagag cgaaggacct gatagacgag 71460 

ggcaatatca gagagagggc atcagtcggg ttagacacga gcattccaca gtgagcagct 71520 

ctggataagc ttttataaat gctggttaag gttttgaatt tgcccaattt tgtcaggatc 71580 

ccagagtcta tcacaaacat acacagtttt ctcaaatctg ctttgcagta tgcccgtgaa 71640 

tgtctcttat ctatactttc agatggtaag accctgaggg cagaggaact cagacccttt 71700 

gtgccccctg taagaccctg ggggatgcag tggacccgac tttgtgttct ctgcacagaa 71760 

aggagtccac tttcgttgag actaaggaag ggaactgaca agcttccctt tctggcttca 71820 

ggttggcagt gcctgaagct ctgagtgcca tctggaaggc agaaaggatc ctgttggtgc 71880 

actgtggccc tgagagtgag gaggtccggg agctccggga aatgaggtcc tgcttactgg 71940 

actcgtcatt cgtccctgtg gggcccttgg tgtagagcaa tcatcctcac cctcaagaag 72000 

gagctctggt gatgactgag atgttctgtt ggcttggagc tctcatcaga gaggacggga 72060 

ccttcccacc tgacctgagc ctagtgtctg gcacagagag cacttgaaaa cagattgaga 72120 

cactcacctg ccatgctggc tgctgcttgc aagagctaac tgccctctga tggaaacccc 72180 

atgcccagaa aagactaaat ccagtatcta aaggctgctt taaagggttg tcactgcagc 72240 

cgggcttggt ggcacacgcc tttaatccca gcactcggga ggcaggcgga tttctgagtt 72300 

caaggccagc ctggtctaca aagtgagttc taggacagcc agggctacag agaaaccctg 72360 

tcttgaaaaa ccaaaaaaat aaaaaataaa aataagtaaa aaataaataa ataaataaat 72420 

aaagggttgt cactgatctg caggcagctc atgctagcct aggcttttgg ctcgatttca 72480 

tctcactaaa cgatgaatct gtttccctgg aacattccta tggtttctag tagtaatgaa 72540 

gtgctgtgtt ccactccagt gagaacttca attcttagtc ttgtattata attgaaaaat 72600 

aatatatagc aagaaatcag tatgactgct tacctcaaga gacatacaat tccacttaca 72660 

atatcctgct tccttaaatt tttcattaag actggtgata tataatttgt gaatggagaa 72720 

ataaatacgt cttactgttg gcagtttctt cctgggatgg caactctgta ttggtttcct 72780 

accagtgtcc taattcttac tcagtggctt tcattgagtg ttcttggcac tcactgtcca 72840 

agcactgatg caaggcaacc ctgtagcatg acttcatagc acaggcctcc ttgttagcac 72900 

acctgaaagc agaccactct ggctgtttca cttgcagaca gaatcttact ctgtaagcca 72960 

gtctagcctc aaacaacatc ctcctgcctc agccttccaa gttctaggtt tataggaaaa 73020 

ggccaccttg cccagcttga gactgcttct tactgccatg tctcttcagg ctcacacatg 73080 

aagtccaggg cactccagga ggagccgtga gtctgtctgc agggcactcc agggggagcc 73140 

atgagtctgt ctgcagggca ctccaggagg agccgtgagt ctgtctgcag ggcactccag 73200 

gggaagccgt gagtctgtct gcagggcact ccagggggag ccatgagtct gtctgcaggg 73260 

cactccaggg ggagccatga gtctgtctgc aaggcattcc aagagcagcc atgggcgtca 73320 

ctcattggta gactgtgagg ctacatctcc agatgccccg agtgctgtgg ttgtgagcac 73380 

tgctgctcat ggtttccaac tgagacagag ggaaggactt tgcccctttc cctaaggatg 73440 

ggtagtaata gtccagacca caagggacag atagctatgg ggttttctga ctcatcctta 73500 

gtacattatt gctgatgacc agtttgtttg gatgagttag tgggaaagaa gacccaagtc 73560 

catacactct gctttttaga acttgctcat cctagccatg cccaaggagc agccgttgac 73620 

tgtcatggca ttacagtgag gaaataaaca gtcctgaagg tgcctggcag cagcttttca 73680 

agaagctggt gttaaaagac agtattcaaa catctgcgga ctgggaactg ggcagcattt 73740 

gagtctcctg ctgtctgtta atttaccctg acaaggaggt gacttgaaag gtttgttttg 73800 

tttggggtag agctttttca ggaaaaaagt ttagtcctac agacaactct atagttattc 73860 

tagtccaaac tcatgccttg tgttttattc ctaaaagccc tgtcacactt tgtaaaatag 73920 

gtgctcttcc tcaaaggata tatttaacgt tttatatatc aggccttatt ctgtgcatgg 73980 

aagctttttt tagatgcttt gtaagatggc tcagtggtta agagcatgta ctgctcttct 74040 

ggaagtcctg ggtttgattc tcagcagcta acaccagctg ttattccagt tcctgggatc 74100 

tgatgccctc ttctggccta tgtgagcact gcatgtgcgt agtgcacaga caaatgcagg 74160 

caaagcactc atacataaaa ctaaattcaa aaaactcttt cattgtctca tgtgacctag 74220 

cttgagaata cctgtgctta tattataatc tagtatgagc cagccacggt agcaacacac 74280 

ctattatctc agcactcaga agattgagac tagatggtca agagctagag tctgggttac 74340 

aaaacacctg tctcaaaagt aaaagggctg aaaaagtgtc tcagcagcta agagcacaca 74400 

ctgcttctcc agagggcctc atttcagttc ctaataccca caccgagtga ctcaaccacc 74460 

tgtaactcca ggtccatgag atccaacacc tctggtcgtc tgcataagct cctacactca 74520 

attatacaga gagagagaga gagagagaga gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74580 

nnnnrmnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74640 

nnnnnnnnnn ntctagagtg tttcaggttt tttttgtttt tttttttttt gagacaaggt 74700 

ctctctatta tgctgcctgg aactttctat gtagaccagg ctggactcaa acttatagtg 74760 

atccactact tctgcctctc agtactggta ttgaaggcat gtgtcaccac accccactac 74820 

ttcaagatct tagatttcca aagaagccgt agcctagaaa aggttaataa gtactgattt 74880 

aaaacagaaa gaaatcaggt acacttagag ctgtagaatg tcagcatgtg acatttgtga 74940 

caagttgtca aaactttgct cttaattcta aagagagaag ctgtcaaaag acttgaactg 75000 

gggctgtagc caacttggtc gagcccttgc atgaagctgt gtgtttactc cccagcactg 75060 

tggggtttga attgatttga acccagtaga ttcgtatatt tgaatgttta cctcatgggg 75120 

aatgacatat tacaaggtgt ggccttgttg gaggaattgt caatttgggg gtgagctttg 75180 

aggtctctct gctcaagctc tgcccagggt agaaagggag cctcctcctg gctgtctaca 75240 
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gaggacatag tctcctggct gccttcagat caagatgtag aactcttggc tcctccagca 75300 

ccaagtctgc ctgcacaatg ccatgcttcc taccatgatg ataatgaact gaacctctga 75360 

aactgtaagc cagccccaat taaatgtttg tctttataag agttgccttg gtcatggtgt 75420 

ctcttcataa caataaaagc ctaactaaaa cacattcctg ctgggcagtg gtggtgcacg 75480 

cctttaatcc cagcacttgg gaggcagagg caggaggatt tctgagttcg aggccagcct 75540 

ggtctacaga gtgagttcca gaacagccag ggctacacag agaaaccctg tctcaaaaaa 75600 

aaaacaaaaa caaacaagca aacaaatgcc agcatttggg aggtagagtt aagaagattg 75660 

ggagtacaaa gtcgtctcag ctagtatgtt tgaggccagc atggaccaca tgagacgttc 75720 

tcaaaacgaa agaaacgaat gaatagataa acatttgagt gtccagtttt ttcctttctt 75780 

tcttgctttg tttttggcgg tgctgaggat taaacccagg accttgttca tactaggcaa 75840 

gcattctcca ctgaggaaca ccctggcgag tgcctagtct gtctgtctgc ctgcctgcct 75900 

gcctgcctgc ctgcttgtta tgtgtatgag tggtaacctg catgtctgtc tgtataccac 75960 

agacatgcct ggtatctgca gaggccagaa gaggatgttg gatcgcctgg aactgggatt 76020 

acaaatggtt gtaagctgcc atgtaggtat tcagaattga acctggtgct ctgaaagagc 76080 

agccagtgct cttgttgttg gttttattgg gggcaggagg tagttatttg gttggttggt 76140 

tggttggttg gttggttggt tttcttgaga cagggtttct ctatgtagcc ttggctgtcc 76200 

tggaacttgc tctgtaggct caaactcaga gatctgcctg cccctgcctc ccgagtgctg 76260 

ggataaagtc atgtgccacc aactccagac aagcagccag tactcttaac cactgagcca 76320 

tcattccagc ccttctttgt gttttgagat ggtcacaaag tacaactcag actgagctct 76380 

tgatcaccct ccctcagcct cctgactgct gggggttaca ggtgtgtcac tgtcctcaat 76440 

tctgagtgtc agatcttgaa aacccattct cgtgaccttg atccttaaaa caaaccctgg 76500 

gagaatgagt tctgataact atttctcact cctcttcaag aaaaggaaag ccagagaaag 76560 

9999999^9^ aagccccaga aacattgata acttgcccaa agttacacag caaaattcag 76620 

acagcctgca catctcagtg gccatctgtg ccatatccac cctgcccttc tctgacctcc 76680 

ccacctccat ccctacagac cttgcagttg agatcagagt ccaagccgta tcgtaagatg 76740 

gccttaggat ctgacatcat ggggactctc acggtcctgt cctcgtccaa atgaaaatcc 76800 

tggagggtcg tctttctcga gtcaaacttg gttacccact gccctggaag gaaacagatg 76860 

gagcatcctg agccactgtc cccagaaagg ccacaggtcc acgctgtgcg tccactggcc 76920 

aaagcaacct gagctgtcag cagcaagaac acaggagccg ctgggtccca gcatgtgtgg 76980 

cacagaccat aggctccatg caccacgggt tctggctatc ctcctgtagt aaactcagaa 77040 

ataagtgggt gttctctctc tgacttggat caccacgctc cttctgttta aagtggcctt 77100 

taatatgctg gtgtgtggta cgtgcctgct ctcctgtccc ctggggactt ggagtaggaa 77160 

gccccagggc tttcctctaa gttaggatcc actcttgcta ctactccata agatggtcac 77220 

aaagcaacgt aaaatggaaa ttaatcaaac cattcctgcc acaagaataa aacagatctc 77280 

aggggaggcc tgtggaaggg tctcctgagg ccttaccact gtctagaagg aagttgacag 77340 

cagttcttga gcaggggtgc gactccagga gttgggggct gctctgagag caggacagca 77400 

tgtattgtag agtgtctggg agggagctgt gttatcctta ccgtgaagat gaggacacgg 77460 

gctcatgggg gcagagccag gattaaacct ggtctgattc aaaaagccag agatctgtgc 77520 

ccagccccac gcagccattt cactggtcaa ctaattcaga aacacttggt ctgatatgct 77580 

catatgctac aagcactgtg gccttcagat ctccctctgg cctggtacct gcattcaggt 77640 

tcaccaccat caccacacac acacacacac acacacacac acactcggcc agagacaagt 77700 

ggggaagccc tcacccttga agtaagccac gccaaggaga aggatgctga gggcactggg 77760 

catttccctc gtggaccggg caatcttccc tttcatctgg gcctgcaccc agttgttaat 77820 

ctcctgaagg tctactcgag ggttgcccgt gaggatccgg ggcctggtcc cataggactt 77880 

ctccagaggg gcaacaaagc tggatttgac tcgaagttct tgagaggaaa cagatcaaag 77940 

atgagagctg aatcagcacc ctcactttga aagcatgcca gaccccagct tcctgctcag 78000 

catcttcctt tgacttgctg gggcatctgc cggcttgccc agaccctggc tagggaacag 78060 

tggattccac cgtttgcatt ccccgtccca ggccctcctg ctgtctccca gagcccactt 78120 

cctctttctg ttcctctgtg gtctcactgg ctctttcctg cccaccagtg ccaggcctcg 78180 

cctgagcaca cacagcctat tgtttagaca tcatggaagc atacagacaa cccaggccaa 78240 

tgaagcaact tcacgccagg cataatgggg cgtgcctgcc cttcagaagc agaggcagct 78300 

ttatgagttg ggggaccagc tgagactcta tagactttga gaggtggggt gggggtgggg 78360 

ctactgactc ctctcaaaca caattctgga agcactcttg aggttcttct caggggcagt 78420 

aacagaggca aggagctcct tgtaggtgct gtggatgtca gggttggtga tcaggtcgta 78480 

gtagagagcc cggtgaatga cagactctgt tcgatgttca gctcctgcca gagagaaaag 7854^ 

gatgccaagc ttcataactg cccgtgaggc ccgcatcagg atagggacgt tagacatcaa 78600 

tccttttgtc ctctgagagc ccgaggaggc cgatattgca gatgttttag ctggacaaga 78660 

tcttcagggc gtggaaagaa ataatgaccg ccttgctagg aagagctcta agacagggca 78720 

aggttatcag agctacagag agaagagtgg gatgtggtcc tgaagttctc ccatcgtaac 78780 

ctaccctgtt ctgaggagga gccagctctg ctcacggcag ctgtacccct agaacctggt 78840 

taaatgacta aaacacgata ggaggccact taaggaacca aggtcgagtg ccacttacaa 78900 

agtggtaggg attgtgtgtg tggcccccac cgcccctttc ctgttcctct gacggcggca 78960 

gcatggaaac tctgagtggg ggaaattcag gtccacctgc agccttcttc agttgacact 79020 

cacccagaga aagggcagag agggccgtgg ccacgctgag tggagacagc aggacgttgc 79080 
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ccgttgggct ggcactggat ctcaggcggt acagatcgta gccgaagttg gagacagctg 79140 

ctgccagctt gttcacaggg accttgaaga aggggtcctc ctcctccacg ggctcgcccg 79200 

tgctgtccgg gactggggag ccctgggtta gaatacaagg accagtaggg aggcacagtg 79260 

agtacatcac ctcctggttg ggttggtcct ctagtccctg gggccatgag tctgaggtca 79320 

gaatgagtgt gtgctctctg actccacaac ctgtgtgctg ggaggtgggg agtgggaagg 79380 

gcaacacaaa agggcttgcc agacctgaac tgtggtctga gaacctgaag cctggcccac 79440 

tttaaaataa aacttgtagg gctggggaga tagcacagta gataaagtac cagcatgcaa 79500 

gttcaaggac ctgggttcag tccccagagc tgggcacggg ggtgcatgct tataatccca 79560 

acactgggga ggcagagatg ggcaggtcct ggggctcatt ggccaatcag cctgaactaa 79620 

tcagcgtatt ccatctcagt gaggggtcct gtttcagagg gcctgaggaa tgactctggg 79680 

ttgactacta gcctactctg tgtctgtttc tgtctgtctg tctgtctgtc tgtctgtctc 79740 

cacccctctc tgtccccttc cctctgcagg gaacttcctc accaccacca acccccaaag 79800 

aaacccaccc tcagaccagt cttccctatt cagcttgctg gctggtccta gtctgcctag 79860 

gttctgctgt gacgccctcc ctgtctttcc tgacaagcca tcccctctga ctagacccga 79920 

gaggaatttg tcgttttctg acctgttttc agtgtcagcc tcttccttat gagactttct 79980 

gcttttttgt tttgttccca gggctttgga tcaaagctgg gctcttacat acgttaggca 80040 

aatgcttggc cacccagctg tacctcccgt ccctgttgct tttcggtttg gaggactttt 80100 

tttttagttt tctgtttggt ttttggtttt gattttttgt tgtttgtttg tttgttttga 80160 

gacaggattt cgctatgtga ctctagctgt cctgggactc actatgtaga ccaggctggc 80220 

cttagattca gagatccacc tgcctctgcc tcctaagtgc tgggattttt agattttaat 80280 

ctgtacctac caacctcaaa ggaagtgtcc atggatagag ttcagtacta catcatgtgt 80340 

gtaacatgtg tgagggcctg ggcttcaccc ccaacagaga agggggagtt agtaggtgag 80400 

gaataagtga ctggctagtg gcaagacagt attgtctaag gtcactaagc cttaagccac 80460 

acttaaagcc cacaatccag gtctaatatg cccatctgcc ttgtccttgt gtgacatgac 80520 

cccaccccta cttcctccgt atagtggcag ctcctctgga tcctgaaagg agagggaaga 80580 

tattcttgtc tcgatgttaa agtaaccaag gcctagaaga gtgaaggcca aagcccaccc 80640 

tggatccagg gctgcctccc tgcactgtct cttctgctgt* cccacctacc caccctactg 80700 

acctcagagc tgctggggac gttctggctg ctgccgtgcc cgagcagggc tccagtccag 80760 

aggagtagca ccagggcctg catcccggaa ctacaagaga aacaagagag caaacgactc 80820 

ccctcaccca caccctcccc tgccactgca cattgcacac tgcacaggga caagagtcag 80880 

gcagagtgag cccttcccct ccctccagct ctcagcccca agtggaccct tgacttgagg 80940 

tcttccgtcc ctgacctgcc cctgcacttc tccttgagct gtgcccccat gttggttcct 81000 

atcaggagac ccaccttccc atctaagctc cagcacaggg aagacccagc agcaggctct 81060 

tcaggcccca agacaatgct ctagcacaaa cacacaccaa ggcttttccg tggaggcaca 81120 

cggccagctc ctttggtagg atttggaacc ctgtctcagg atgggagcag agcccaggtc 81180 

atagacttac agaacatctg gtctggtcct gctatccacc aatagttctc tgaccaaagc 81240 

ctatgttaaa gacacacaca cactttcttc taggtaggtt cttgtgtatg tagcccaggc 81300 

tagccttgaa gttgcagcta ccctacttca tttgcctcct gagtactaca atgccaggtg 81360 

tgagccatca tgcctgactt gactcccttc ttctatttca agagcaattc ttagttaaga 81420 

gggtatgaac cagggccaca ctgccnnnnn nmmnnnnnn nnnnnnnnnn nnnnnnnnnn 81480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 81540 

nnnnntctta tttttagctt gtttgttttt cttacttgag acctggggag gggaggtgta 81600 

tgtgtcttcc atttgcttct tctacctaat aaagtttctt ggtgatgttt gggggggggg 81660 

gagggggtag gactcaagaa gggattctcc cataagctgt tccgtttggg tagtactatg 81720 

taaggaagtt acaggtgggc agagctcggc tctgcctgac tggcgtgctc tgaggtaaag 81780 

gtgagatggt gcaagatttg ggccctcagg agttggctct gttggccctg taccttctgg 81840 

tctgtgggta aggatgacca gtaggtgaga gatgagggaa ccagaacaga aggtgaaagt 81900 

tagtggggcg gagccccaga ctagtcaggt gggggtaaac tagatgactt tctggaaccc 81960 

caaggggctc ggagactagt ggtgttggag aagacctcta atgtgttgta aggcctctga 82020 

actcagtagc cgaacttgat gccagaaagc cccaaactgc taaacccaag caggagcggg 82080 

acgccatccg ttccatggct tcacccgagg tggccccatg gctgcgccaa tcaatgagca 82140 

gccgagagat aggggcgtgg acaagccagg aaaagttaca gcacgctgga aagataatac 62200 

aggccaggaa gccccaggca cagcagggtg gaaaagctag atcccgattc tgccggaggg 82260 

gggccccttc gaggtcccgg gcaccgggtg ccaggatcag agaaactgac tgaaacctag 82320 

ctgacctgcc cagaccatgg catcctgggg actccttgtg gctggcgctt ccttcacggc 82380 

gtttcgggga ctgcactggg ggctgcagct gctgcccacc ccgaaatctg ttcgggaccg 82440 

ctggatgtgg cggaacattt tcgtttcgct gatacacagc ctactctctg gagtaggggc 82500 

gctggtcggg tgcggaactt ggggactgac aaagcactga ggggcggggg tggaaaagag 82560 

ggcctggaag actgaagttg gaaccttttg gaatggaact ggtttgggtt gtggatgggt 82620 

gggagtaccc agtgggagaa tggatctagg tctgggagaa attgacctta gctctttgtc 82680 

ttctccaggc tgtggcagtt tcctcaaatg gtcaccgacc caattaatga tcacccaccg 82740 

tgggcacggg tcctagtagc agtgtcagtg ggtgagtgta cagaaaaggc tgaatcggga 82800 

aaggccttgt tggaccggga attctaggtt cctcccccat ctttggaatg gagcagatgt 82860 

tgctggaggt ttgctgtgag gaattaagga cctgagaaaa gtgggacttg agatatctag 82920 
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gctgtgcatc agctctgagc gaggagcctc atagtcttct ccggtgcctt caggttattt 82980 

cgctgcagat ggagttgata tgctgtggaa ccagacattg gcccaggcct gggaccttct 83040 

ctgtcaccat ttggcggtaa gactctgaag ggagaggcca ggtagtaagg gagcatgtcc 83100 

aactcaaggg cccaacctct ctcttcagtg ttctgtcctc tgacttttcc acaaagcccc 83160 

ctgaaaacct atcctctcag acttggattg agttggaggg aggttttgac tggctagcca 83220 

ctcctgggca ctgcccaagg agtttggttc tccccacaaa cctccagctg atcataaaaa 83280 

aaaaaaaaaa aaagccagga atgaaagcta gggtatgcta tgcaaatagt gtggcttggg 83340 

gtaagagaac ctctggtcca gggctgctca tgccccctag ataagggtca gcagaaaggt 83400 

caggattgga ggcagtccta aaaaatgctt gggtaatata aagtgaataa ataaaaaata 83460 

aataaatact aatttttaaa aagctgatac ctggaaggat gaggcagaga gtagaaaaaa 83520 

catgcgtggg tgtccctagg ataaggagct gggacttgtt gggcacaggt catgcaaagc 83580 

ctgaaccttg aaccttgcct gcaggtagtg agctgcctca gcaccgctgt tgtgtctggc 83640 

cactatgtgg gcttctctat ggtatccctg cttctggagc tgaactccat ctgtttgcat 83700 

ctacggaagc tactgctgct ctcccataag gccccatcct tggccttcag agtaagcagt 83760 

tgggccagcc tggccaccct ggtcctcttc cgccttctgc ctctgggatg gatgagtctg 83820 

tggttgtccc ggcagcacta ccagctgtct cttgctctgg ttctgctttg tgtggctggg 83880 

ctggtcaccg tgggcagcat aagcatctcc acagggatcc gaattctgac caaggatatc 83940 

ttgcagtctc agccctaccc gtttatcctc atgcacaagg aaaccaagac acgtgagcct 84000 

gttgccagga acacttccac tctcagtctg aaaggtgtgg aagttttctc ttctgtcagc 84060 

ccccagggag gtggggctgg gaagaggaga tggtagccca ctgcatagtc tactatgtag 84120 

caaggactag actgtatcat cagagagaga gagagagaga gagagagaga gagagagaga 84180 

gagagagaga gaacattgta tgagatctcc attacagtca ggaaatcagg agatctaaat 84240 

aactttaaaa gtcccacagt ctttacatat tcttaaaatt tcaatctctt taaaatatcc 84300 

atctctttta aaattcaaag tctttttaca attaaaagtc tcaactgtgg gctccactaa 84360 

aacagtttct tccttcaaga gggaaaatat cagggcacag tcacaatcaa aagcaaaagt 84420 

caatctccaa ccgtccaatg tctgggatac aactcacgat cttctgggct cctccaaggg 84480 

cttgggtcac ttctccagcc aggccctttg tagcacacgc* gtcatcctct aggctccaga 84540 

tacctgtact ccactgctgc tgctgctctt ggtggtcatc tcatggtact ggcatctcca 84600 

aaacgctgca tgaccccttc agtcctgggc cttcaagaga gaagactaga gcctggcaaa 84660 

gtggcacatg ctgataatgc tagcacttgg gaatgacaag cagaaggatc agaagttcaa 84720 

ggccagcctg ggctacaaga gactctgttt caacaaacaa acaaacaaac aaaccaaaga 84780 

agagaaagaa aaaactggac atgacagccg gaacattatc tgacattcat aaggtcctga 84840 

gttcaatgcc aagttggcag tgcctagttt gataagggtc tagccactct ggtaatacca 84900 

tggctgactg aacaccttac ccagcaactt gctgatagac tctgccttcc agcaaaaggg 84960 

aggagcttcg ctgaggagag aacattgaac cctattgtat atgaataaat tgctgtgcaa 85020 

atgatttcat cagtctcttg tgaatgtgat tgctttgagt catttttctt ggctccagtg 85080 

ttatcctggt ctgcagtgtg gtgtggagtt gtggaagctt tgagttggga gggtttcctg 85140 

ttaaggtttc tctggctctt ttctttcctc ccggtttttg ttttgtttgc ctggtggggt 85200 

tctctggtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tagaagttgg 85260 

cggggggtgg agggggctgg agagatggct cagcggttaa gagcgccaac tgctcttcca 85320 

aaggtcctga gttcaaatcc caacaaccac atggtggctc acaaccatcc gtaacaaaaa 85380 

aatctgatgc cctcttctgg agtgtctgaa aacagctaca gtgtgcttac atataataaa 85440 

taaataaata ttaaaaaaaa aagaagttgg catggatgat gtagtgaaga ctggcattag 85500 

atatctctgg atccccctgc ctctacctct tagacactgt gagtatggaa gtgtaccacc 85560 

gcaccaggcc aggctagaac attctctgat ctacaaatac ctagagtatt attcctctat 85620 

gatcagaaaa cagacccagg gggccacaga aatgtcttag taggtaaaaa cacttgcttt 85680 

caggcctgat aacctgcggt tttttgtttg ttctgggggg cgggagaggc tggctggctg 85740 

gctggcctgg aattcacaga gatccacctg cctctgcctc ctgagtgtca ggtaccagga 85800 

tcacaggtgt gtgccaccac acttggccta actgcctgag tttgagcatc agtactcaca 85860 

tggtactgag gatagaatag actctcacca gctcttctga cttccacatg tgccctgcag 85920 

catgggctct ccttccccaa aggaaaaata aatgtaagaa ttaaaaaaaa aaaaaaaaag 85980 

caaacccagg tcttgtgtga tggctcagca tcaaagctac ctcccgccac agctgaccac 86040 

ctggtgataa cttatagcct tgttatgctc tcctttgacc tccacgggca tgctgtacac 86100 

gtatgtgtgc ccacacaaac acacaatcaa gaaataaatg cagccaggcg aggtggcaca 86160 

cccctttaat cccagcactt gggaggcaga ggcaggtgga attctgagtt cgaggccaac 86220 

ctggtctaca aagtgagttc caggacagcc agagctacac agagaaaccc tgtttcgaaa 86280 

taaccaaaaa aaaaccactt taaatattat ttttattttg ttttgtttat cctggaactt 86340 

ggtctgcaga ccaggctggc cttgaactca cagagatcca actgcttctg cttcccaagc 86400 

acattaaagg atgtcccacc actgcctggc taaagattta ttttttcttt ctttttgttt 86460 

tgttttgttt tgttttttct aaaaaatttt tttaaaaaga accatccctc ctagcactca 86520 

ggagactctg aagtcagggc cagccaggtc tactgagtga gctctagggc agccagggct 86580 

ccacaaagaa accctctctc aacaaacaaa caaaagagaa cagacccaac cagacctgag 86640 

gacacacact tgtaatctaa gcccttgaga ggctgagaag ttcaaggcta gccacaagtg 86700 

tgtggtgcat tcaagagcag cctgggtggg ctacagaaaa agaaagaggg agagagagaa 86760 
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tggttaatga agatgactct ggaaaagtga aactcaagag aaagcccctc agatttgctt 86820 

aagacgagtt gagggtggag aaccgccaaa gcggacgagc cagacagaga ctgccaacaa 86880 

agttcaatcg gttcaggtac attacttcca aaacgccatt gccacatcag gatgcttcaa 86940 

tcagccaaac caacgcagcg actattgact tctgcatttc agagacttcc gtctctgtcc 87000 

agggcaatgt cactttagct ttcctttgca gaaaggaaaa gtccctgcct ctgatgtggt 87060 

agatcctcac acaccttctg ccagatccag acactggtat gactcagcct cggggagctc 87120 

tatctacaga gataagggta caaggcgtgt gtgtttaaag tatgtgttta aaagtacaaa 87180 

gtgagagtcc ctggaaaggg ctccctgccc tcaccatcac cgaaagcaca aaccttaggg 87240 

taatatctga cattcctgga aatgtatgta tgtattcatt atgtagccct gactgtcctg 87300 

gaatggggta taaaccagga tggcttcaca tctcagagac ccatttgcct ctgcctccca 87360 

agaactaaga ttagaggcat gcactaccat acttggctca tgatttactt aactttattt 87420 

tatgttcacg aatgttagcc tgcatgtatg tgtgtgcacc atgtgcatgc ctggtgcccc 87480 

agaggccaga agaaggtgtt ggttggattt cctggagatg aagtcccaaa caactgtaag 87540 

cagtccaatg tgtgtgctgg agatgaaact tggttcatcc acaagagcag tatgtgctct 87600 

taactgtgga ggcatatctc cagcctcaga tttcccagtt aatgtttgct ttcgcaccca 87660 

ggcccatctg cgcatgcgct ggagacctcc tttaccgcct tgagcctcat tggccaattg 87720 

tggctgggag acttgcagat cccaagtggt acaagagaag aataaactgg tgtgctatga 87780 

actcacctct tctctgtagc cattggctga gcatactttg cctcaaccta ccgcccttcc 87840 

ttcccctaat cctaaatctt tgccctctcc aaatgtgctc ctcccccgca gtaatccagt 87900 

ggtcgctggg gctctagaga gatggggggg gggggagcaa cgggtacagc ttaaggcagc 87960 

tgcagcagaa cttttttgct gtatattgag tcttaaaaat tcatataaac tttgtgttct 88020 

gtttctaaat ataaccccat ctgtttcaac acaaaatgca acaacaaaat gtttcaaatt 88080 

gctatttgga ataattaaaa aatttcaata cttgatttaa aaatgcttta actttttaaa 88140 

taaattttaa atgttattat ttttaaaaag ttacaagttt aaaaaaaaga aagatagaaa 88200 

tcacataatg aaattaacca tacgcaagtg aggctcggtg cactggtaca cagttacagt 88260 

agccatttgg agtggaggcc atggcgcttc acattgaatt ttatactttc tttatagaat 88320 

attttttatg cacctatcta ctactgataa caaaacaccc atgagagagt tagaattaga 88380 

catcaattag ctttgatcct ctgtcataac tcgtgtccac tccctgcctt agtcctacct 88440 

catccctgtc ctcttttcta catcttatac tgaatccaca cactcagttg tttacacaaa 88500 

cacatacatc actgtccann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88560 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag 88620 

tgcctggctc tgttttgctg ttgttgtttt tgtttgtgtg tgtgtttaga tttggtttgg 88680 

tttggtttga ttttgttttt ttttagagaa tcttactatg tagctcaggc tgtccttgaa 8874 0 

ctcacagaga tctcttgtct ctgccttcca agtgctgaga ttaaaggtat acaccacctt 88800 

acctggcccc tttcatctat ctatctatct atctatctat ctatctatct atctatcatc 88860 

tatctatcta tctaaaattt atctgtgtgt gtctgtgtgt acattcccca gagcctgtgt 88920 

ggtagtcaat aagtaaccct cagaagttgg ctttctctaa tccttggatc aaacttgaat 88980 

tgttaggctt ggtagcaagc atgtttaccc actgagccat ttatgacccc atggcccagc 89040 

atcttccatg ggttctgggg acacaaatgt gtactttgat gtttacagga caagcgctta 89100 

accaaccaag tcattttccc agccccatcc tgactcccat taagtgttct ttcccccaac 89160 

ccaggaccaa atctagagga gtgtccatgc ccaacaaaca ctctgccaag cctctcccct 89220 

tactgctctt ctcccttccc ttccttcatt tcttcgttcc ttcttttctt tctttttgaa 89280 

acaggtcttt tctctgcatc ccaagctagc cttgaacttg tgatgtagct caggctggct 89340 

ttgaactcac agctgtcctc ctacttcagc ttcccaaaca ctgggattat agacctatgc 89400 

taccacacct ggctcatttt tcaaataaat aaaaagaaaa tcaaaaagtt cctagaacag 89460 

tcacaggatt cacaaaaact ttggaaggag actaaaaatg gatttttaaa aaatgcttga 89520 

agcacaaaga gttgttgaaa gaagagagaa gaggaaaagt tagcttagta ggtagaagtc 89580 

aatcaagcct cacaccctga gttcaattcc tgaccctatg gtagaaggag aagagcaatg 89640 

ccggaaacat tatcctctga cctccagacc cgctgtggca cgtgcatgca cacacacaag 89700 

caggagccct tggagggaag tcctagaaat gaatcttact gaagcaggtc tgccaggccc 89760 

tgtgctcagc cattttattt ttcctttgtg tacccgacac gcttccattc tcaaagttgt 89820 

gagtctgaga ggaagtactc actgtgtccc cagtgagctt ctgtcttacc ctgggtcact 89880 

tagatggggt cacttagtgg tagccttggt gtggagaaag agaacacagg tccgagtagc 89940 

cagtagacct gagtctttat atctgcaaag ggtgttgggg cataatcaaa tctccccccc 90000 

tccccggggt cctgatacca ggttgtatta agtgtatgtg catggtcctt ccaagtcttg 90060 

acacatcatc caactccaag tggctttctc atttttcctt gccagtagcc tcttggtgag 90120 

gaaatggctg aggaaaacag agttgcagaa agacagggcc atggcctggc tgcaggcttt 90180 

ctctgagtct gaagagggtc agcgactctg agaaatgaag ctatttctga gtgagagggg 90240 

ccaaagaagg aacacggcag agggagagcc cccgaggaga tggagacaga agccggagag 90300 

ggaccctgtg cgaggctgga ggggaggaag aggggggagg agtgagaccc actgtcatct 90360 

gttgggcaga gaggggctac attcatctgc agtatggtgt agaggggaca gagagtgatg 90420 

gtaacaggaa aaatttgggg ttgagggggg cagcctgtag ggctgggccc cagcagtgta 90480 

cagctaggta gagtacacag taactcccag aattctctgg ctccactaaa tccctgttcc 90540 

gctccgtgca gagtaaaacc cacacagggt ggatttcagt ctcctttgca cccccctcca 90600 
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ccccccctcc 
gtaaaagagg 
gctccgtgac 
agatgatggg 
acaccacgga 
agctgctgga 
tcccaataaa 
gatctgccac 
aaagcctcat 
cctgatgata 
ttatgctgat 
atggggctgg 
cccacacttg 
tgtgtctctg 
tcaatggctg 
ttcagtgcct 
gtctgtcctt 
ttttttttaa 
gagggagtca 
tccagacctt 
acatcattgt 
acccccaggc 
tttcttcttc 
catgtctcgc 
gcaggggctc 
tggaaagcac 
catgccattg 
caccctcctg 
gacgggggcg 
tctcgtagag 
agatctcctt 
ctcctctggg 
ctagaagccc 
gggaggagga 
tcacaaagag 
gatgtttatg 
gatttatata 
gaacacagac 
aggaagcaga 
caatggccca 
atccgatgag 
actaaagtgt 
aagtggttgt 
ccagcctgga 
gggagggcaa 
tagtgagtta 
tcccaacaac 
gtgtttaaag 
ggccagagca 
atctgtacaa 
aagaaagagg 
acagagatga 
ggggggctac 
aaaatacata 
agtcccagca 
agctgtatgc 
agacagacag 
ttgctattca 
ccaactccac 
tgttgtgcac 
agaaaagaaa 
ccaggtcctt 
ctttcccaac 
ccttcctggc 



acccccagct 
ctggtgagga 
actttaaatg 
tcagggacct 
gacccctgtg 
gcgcaagcct 
cccctttccg 
cttatcacac 
tctctgctta 
acaacccttg 
agaaatgtac 
ctggggcttt 
gttgcacaag 
tgtgtgtaaa 
cacaattgtc 
ttcctctgtg 
gggttgtctg 
agatttattt 
gatctcgtta 
cggaagagca 
cttgcccacg 
accgggacac 
caagcaggtc 
tgcctctctt 
taagctgcct- 
tgtgacccaa 
atgtgcacgc 
gtgcttagga 
ggccacagca 
gtaaggggcc 
tgtccccata 
gcagcctcaa 
tcttacgtta 
aaggaaaagg 
ccctcttttc 
tattttttta 
ggaagtgcag 
ttgtaaaggg 
aggtaaaaaa 
ggcggaactc 
ccagagtgtg 
gcactggcct 
tattctagta 
ctgcatagca 
catttggaat 
gttcagcggt 
cacatggtgg 
tcagctacaa 
agtagaggtc 
ttacagtgca 
gtggggcagg 
gcttcatgtg 
agtgagagta 
ngcnaaaaaa 
cttagaggac 
agtcctgatc 
agagaaacac 
gaccccatga 
agagctgttc 
acatgcacag 
ggctggcttc 
cccagtccag 
gcagtgtcca 
tccctcacca 



ctggtcacag 
catcggaaag 
cctcagataa 
cagcgcctct 
accccctcgt 
caaggcctgt 
cccaggatta 
acatacaaaa 
acccaagtga 
caatccaata 
tctgcatgtg 
gcacatcatt 
gcagaatgac 
acgcctctct 
ctttctcttt 
ggtttgtcct 
gctagaacac 
atatgtaagt 
ggatggttgt 
gtcgggtgct 
actgctctcc 
aacatcttct 
tcccaggaaa 
ttcgggaacc 
'gggcaaagga 
gcacattttg 
tgccacacag 
actaacggct 
gcgcaggctg 
cactgaaatg 
cctcagcccc 
gcccagcacc 
ggggatgaca 
gaggggagag 
tggctttttg 
gaacccgtat 
tgagttaagg 
ttttgtaaca 
aaaaaaaaat 
ctgcttgaaa 
gccatagctg 
ttccttgaca 
cttaggaggc 
acacccagtt 
gtaaataaat 
taagagcgct 
ctcacaacca 
tgtacttaca 
ctgagtttaa 
ctcatataca 
ggagggaaga 
gaaacacagg 
tcatgagttc 
aaaaaaaaaa 
tgaggcaggg 
ctatcccctc 
aaagaaaggg 
cctgagctca 
tatgatctct 
attagaaaaa 
ttccacgtca 
tgggggctga 
gcaaactcaa 
atgcgggggc 



ccagtcagag 
tctgtaccct 
aacagtgaga 
gaggctcaga 
gcagagggag 
gctagttatg 
gtggacacgc 
atcccttgag 
cacctatatg 
gaggggaact 
gggagcctgc 
gggactcaga 
aggatgttat 
ctggagccct 
ccaaggacct 
gctagccccc 
agacatcatt 
acactgtagc 
gagccaccat 
cttactcact 
agaatggtgg 
acacgtgtag 
tggcacttac 
ccccagaggg 
gcagggggta 
cagcagtaat 
aaaccagtga- 
ctaatgagaa 
gcactgcgtg 
tcacttaaat 
acgcttcttt 
cactttttag 
ggaggtagag 
agggaaagag 
actgcactgt 
ttattaacag 
ggggcaatta 
tccaatcaaa 
tgtcccatta 
gaaggtgaga 
ggtcatgagg 
aaggatgcac 
tgaggcagga 
tcaaaataac 
aaaacatttt 
gactgctctt 
tccataagga 
tataataata 
ttcccagcaa 
taaaataaat 
agaagaaagg 
cctgtagtcc 
aaggtcaact 
acatagccag 
cagaaagaaa 
cccccccccc 
gccttcagat 
aagcctggga 
atatgaatgc 
gaggaggaag 
gtgtgagagg 
actgaggcag 
ctctacagcc 
attggctccc 



ttgggggtgg 

ccactagcaa 
gactctcctg 
caccaggata 
aatgccaatg 
agtctactgc 
cttgctcaag 
gatggttacc 
gcagatccct 
cgggtttctg 
cttgctcacc 
gatgttgact 
gcctggtgtg 
cctgtctgtc 
ctgtatgggt 
tgtcactgag 
gtcttttttt 
tgtcttcaga 
gtggttgctg 
gagccatctc 
gcaggaggat 
gtcttgtgca 
agagattgaa 
agcagcagaa 
gcatggagcc 
gtcaaattct 
cacaaaggca 
atgagagctg 
ttggaggagg 
tagccaccac 
ctttttttct 
agctgtaaac 
atcaggaagg 
atcgagagag 
gagttattta 
cctgaaagga 
agagagcaga 
ggtgcttcag 
gaagctgaca 
agggagggac 
cccagggttg 
ctatagctag 
ggat caeca t 
aacaaaagga 
ttttaaaaaa 
ccgaaggttc 
gatctacgcc 
aataaattct 
ccacatgatg 
aaataaatct 
taagaagcta 
tggcactcag 
tgggtgagac 
gcatgatggt 
aggaattcaa 
ccagagacag 
ggctcagcaa 
cccaaggtag 
tggggcatgt 
aaaaacataa 
agggtctggc 
cggaggaggc 
tgtcctgatc 
aggctcctgg 



ggcagatctt 

agtgccagac 
gtggcaggca 
aagaataaaa 
tggcccagct 
tgctgccttg 
ccgagtccct 
atcctgggac 
gtgtccttct 
tcagcttcct 
ctgagacccc 
acatgaacgt 
tgagtgtgtg 
tgcctcttgt 
gtgtccttca 
aaagtcttct 
tttttttttt 
cactccagaa 
ggatttgaac 
accagccccg 
gtgacccccc 
ctggctttgc 
gagtttaata 
accagggctg 
ttagccaatt 
gccgttcagg 
cagccttctc 
aaaggagaga 
ctgacccact 
tcccaacact 
tctttttctt 
caccctggtc 
agggagggag 
catgcattca 
gccaacaata 
gagagacgga 
aagagatacg 
gtattttcca 
ctggatggag 
acagaccagg 
gaaggacccc 
gcgtggtggc 
gagtgtatgc 
agtgggggtg 
agaaaggggc 
tgagttcaaa 
ctcttctggt 
ggagtgaggg 
gctcacaacc 
ttaaaaaaag 
aataaaaggc 
gtggggttgg 
cttgtctcaa 
atacatttat 
gatcaggctg 
acagacagac 
ttaaaggcgc 
aaggcaagag 
gcctacacta 
gattgtttca 
ccctttgtag 
aataacggag 
cacagagaag 
gccccccccc 



90660 

90720 

90780 

90840 

90900 

90960 

91020 

91080 

91140 

91200 

91260 

91320 

91380 

91440 

91500 

91560 

91620 

91680 

91740 

91800 

91860 

91920 

91980 

92040 

92100 

92160 

92220 

92280 

92340 

92400 

92460 

92520 

92580 

92640 

92700 

92760 

92820 

92880 

92940 

93000 

93060 

93120 

93180 

93240 

93300 

93360 

93420 

93480 

93540 

93600 

93660 

93720 

93780 

93840 

93900 

93960 

94020 

94080 

94140 

94200 

94260 

94320 

94380 

94440 
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acacctgtgg agtgctaggt gatttgctaa tgttgggcaa catttgccca cgtggggttc 94500 

ttggctcttt ggtaatagac atgcctagca ggagggcgga gcttggaggg gggagtcctg 94560 

gggttgcccg tggctccctg cagctggggt gtctggccag ctgaagaagg agccatggca 94620 

cgcaaatggg agagcatgga acagaggctg tggatgctaa gcaatatggg aggcagtcta 94680 

agcttggaag cagcaggtgt ctgggaacgg gcctgtggcc caggcagatt tccagtgagc 94740 

actccagttt tttggcacaa ggaacaagct ggctgagccc aagaggcaag tggtgataat 94800 

gaaacccgca gttgaggaac agcgggtaag ggtgccatgg gagcccatgt gctcatgaag 94860 

aggctggggt gtgaagaaga gcccatgcag ggaagccaca catcccctcg agttccaggc 94920 

agaggcagag tccctgagtg gggctccctg ggtctcccct tacctaacca gtctcccggc 94980 

accccagcaa acaaaatccc atccataatt tgaggtttat agagacctca aaggctgagc 95040 

tactgtgtgc cactaaccat cagcctaacc ctcccccact gtcttctcta gctgcccctc 95100 

tttcttctga gactgtgata gtggcgggga cgggttggga gtgtgtgtga agccctctcc 95160 

gactctccaa ccccagctga gccccttgtt ctgcagctca gtaacacagt aacacaggct 95220 

cagttctaca ctggttgaga acactcacgg ctctctcagc tccttagaga gcctgttttc 95280 

tcattttcct gtccccaaag cctagacaat ggctggtcca tttgtaagct tatctgagga 95340 

tgccaggggc caccccatgt ctccactagg ctggcaatgt tctctgtcac tgtagtacag 95400 

aagactgcct ggtgggaggt gagataagga aagggatggt ctcccctggg gttcccacac 95460 

agtgctgagc ggaaaatggc agaatgggct gggaggtaac tctgttgcta gagtacttgc 95520 

ctagcatgtg caaggaaggg cctgggttcc atccccagca ctacagaacc caggcgtggt 95580 

ggttcatgct ggtattctca acattcagga ggtacagtca ggaagagcag aagttcaagg 95640 

ccatcctcag ctacatagct agcttgagac cagcctgggc tatgtgagac tttgtctcca 95700 

acaaacaaca acaaagcagc agaaggccaa ctggcaagag gagtattacg taaagtaaat 95760 

ccatctcaaa aagcaagtag catgtatctt ctttcatttt tttttacatt ctataaaggc 95820 

ctatcaagtc atgtatatat gcatgtatgt ttgtatgata tgaaagtagg gggctggata 95880 

gatggctcag cagttgagag cacttggatg ctctttcaaa gaacctgggt tcaattccta 95940 

gcacccacat ggcagctcac aactgtctgt aattccagtc tcaggggatc tggcaccctc 96000 

acacagatat ccacgcacat aaacaccaat gcacataaaa- taaataattt ttaaaaaaag 96060 

aaattggaag taaaactctc taaggagaca aaagggactg aggggaagtg ggaggggcat 96120 

gaagggggag ggcataggtg tgtggtgtgt ttaacatgca gaatacactt ctataaaagc 96180 

tttggggttc attatgcaat gtatacatgt gtgggtgcaa gatgtaagct gtgcatatgt 96240 

gtgggggcca aaggtctcct cctcaatccc tctctgcctt attttcattt aaattataat 96300 

tattactatt agtgtgtggt gtgatgtgtg tgggtgtgtt aagccctcac ggcaatcaga 96360 

ggatgtctgt ggtctgagga tgctctctta ccatgttcgt gtgggttctg tggatggaac 96420 

tctggtagtc aggtttgcaa agctagtgtc tttatctgcc gagccacctt gctggccttc 96480 

aaccttattt tttgcattga acatggaact tcctgagttg cctggacagc aagtccccaa 96540 

gaccctcctg ttcctgcctc ccccntgtcn nnntcacang aggacacacn gcttantggg 96600 

tntccggatt gctgcncacc tccccgccnc ccnagcctcc tgcctccccg cccctcgccc 96660 

ccgctggncc ctcccccccc cccccccccc cccccccccc cttccccccc ccccrmnnnn 96720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 96780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnncagaca cacacactca aattaaatat 96840 

agctctaact gctgttaaat tcacactcct tcacatcccc acgctaggac tctaaggagg 96900 

caccagcaag gcccaggtcc agcttgactt agagcaaagc atcctccccc ctccacacaa 96960 

tggaaacgga cggaaagggg catggaagca gaaccagaca acagcagcct agccaagccc 97020 

aggactctgc tccttccccc catgcctgcc gtgcaactgg ggaggcaaag ccccagccgg 97080 

tgctttctga ccgcttagcg gaagacaagg ggagcctgtg attatgattt ctgctgattt 97140 

gcaatgaaac actaatgcag tgggcttttc attaagccag atttattcaa tctaaagatt 97200 

ttatttcctt tatgtagaaa gtgcatcttt atatgttgtt ggaggagcag agatgtgata 97260 

aaaagaaatt tctcttatga actaatagca ctgatacata gtggtagcta tgcctaggcc 97320 

tctctctctc tctctctctc tgtctcctgt gcatgtgtgt gtgtgtgtgt gtgtgtgtgt 97380 

atgaatgcac acaaagtagc ccccccccat attatttctt ctgtgggatc tccagactca 97440 

gcaaatggtg gtgactggga agtctggcca tgcaattctt gccttttctc ttgccagccc 97500 

aatccctttg cattcaaacc cgggctgctt gctgtggcca gccctttcac ctggagtcct 97560 

tcctcctcct tacctgtctt cccatccttt gcagacaatt atcctcaata actagccaat 97620 

tacccttaag gacaattata ctcttccatc agcaaacacg ggtgttcttt ccttgagtct 97680 

tttgatgaag tcgatattaa agagatgctt tatttacata aagtcaaata gctccctttt 97740 

agaagggttt gggttcgatg tcaaagtttt aaaatcttaa ctagaggatg ggtgtagagg 97800 

gcttttggct agggtagaaa agagatggag atacttattc tgatgttgct ttaaaaggta 97860 

ggatgcccag agaaggtgga aggatggggg agggagggtc cctcctcaag ctaatgaatc 97920 

taaaagcagg gatgagctgg gcgctaggag tggaaccagt cagaagtgtc tgcctttgac 97980 

tgaccacagc tcctgccctc ccctccccca gtctctctgt gaaccgccag cattaggagc 98040 

taatcgcttc agaaagccag attggaatgt gttgctcacc ctccactgct cagaaaacct 98100 

ttattccagg caaggactga cccaaaccga tcatggcatc tgccaatcag gaggccaaag 98160 

gtgccggcag ggcgggacct agctgtgcag aaacagctcc gttatggcgc gcagaaaaag 98220 

ctggggggaa aggctaccgt tttatctctt ggcagatggc ttctctcttt gatgctttgg 98280 
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gccttacctg 
gttctcgaag 
cccccgctgt 
tcggaaattt 
gtagagtggg 
gaaagaggtg 
ctggcttccc 
tattaggacc 
cctggcttct 
ccctaaaggt 
ggacgctcct 
tactttcttc 
ctgagtgaga 
aacaggaaag 
ttacctttcc 
gcttcaggct 
aacaagggga 
agccagacat 
atctgccatg 
atgctggcct 
gctccttgga 
tcttagctgc 
agaccctcca 
tccacaattg 
gcaaacccca 
tgagccctgc 
gatccatgct 
ctcatcctgg 
gattaaaatc 
acaaagccct 
ttctaagtgg 
cccccccccc 
ataattctcc 
gaagcacgct 
ctctcaagtc 
atttacactt 
ctgaggactc 
ccctggcctc 
ggtacaccgg 
aaaggcttcc 
aaggtaaatg 
ccttgttacc 
ctgctgagct 
gtctccccac 
ttcatttgca 
tataagaaaa 
ggccaaacct 
ggcccggcac 
gatgaatgaa 
tgtgtgaggg 
tcctggggcg 
tgtctgtaac 
acgtggtcca 
gcctctgaag 
aaaacaccca 
caaagtgttt 
ctcagagtga 
agattcacgg 
cctctctctc 
ccatgtagga 
tgaacgagtg 
aaggcaagag 
ggttctatct 
cccaaccaga 



ttactgcctg 
ttagaaggaa 
gtttctaaag 
ctttttaatc 
agttccgtgg 
ttgtgggcct 
actctcaaga 
agagggactg 
ccaggagccc 
atggggatga 
gtaccccaga 
tgttgttgtt 
aaaaagagga 
ggcaggcaac 
tgacccccag 
aagtaaacag 
ggaggcagtg 
ttcaggcacc 
cctgccatct 
ggaggaaatt 
aacccacgtg 
ctcaagcact 
gggagtcatt 
aggagacgcc 
gccactgagg 
ctcccagggg 
tttatgacat 
ggaccaggga 
attttttttt 
cctgtgctga 
agtgtgatgt 
aaacttactg 
ccatctccct 
gaagaggcca 
tacacggagc 
catgccgcag 
catcaagcac 
cttacagatc 
gggctgcagc 
taggaaatca 
gatgttattt 
atgcagggct 
gagggagggt 
ccccgccccc 
gcagcttaca 
agaggaatga 
ggccgagaga 
aacagggccc 
tggatcatct 
agatgtcaac 
tcagcttcag 
ccgcagaggg 
gacccctctt 
gtatgtcacc 
aacgaaagaa 
ttcagcatga 
gatgcggagg 
ctggcagaaa 
ttttcaaggt 
cctggggcct 
ataggaaccc 
caggaggcat 
gggaggcagg 
gagaattcaa 



cacttgactt 
aaaaaaaaag 
agctgttttc 
atagcaggag 
ccacagagag 
gtgcacacat 
tgaggtgtgt 
tgtgtgtgga 
acatgccaac 
gggaccaagt 
ctcagccgcc 
tttggaagtg 
gagggaggaa 
aagacaatga 
ggcttagcca 
ggaagagttg 
gccaggcagc 
tctccttccc 
gagagaggcc 
ggtctttagg 
acagagctcg 
gtaaggttta 
gcctaccctt 
agacctggca 
aacttgcata 
acaactgggc 
agaaaggcct 
aggcgagcat 
tctgctccat 
aattagaccc 
agtggcagga 
tctcttaacc 
cctctggtgg 
gactcgggat 
tgatttattt 
cttcctgcgt 
gctgccttgc 
atgacctcct 
ctctctatgg 
gacgctgcta 
aaatgttgcg 
gtggacgggt 
ggggacacac 
gccctcactc 
gggcttgttg 
aagagaaagc 
atccatgacg 
gccacaagag 
gtccttagag 
agaggttccc 
tcggaacagc 
aagggcgggg 
gggttgatct 
gctggctcag 
aaactttgtt 
caactcactg 
gaagtggatg 
atggctgaga 
ggtttttgga 
gtgcagatgg 
aggcactaaa 
ttgagggtaa 
agtcccaatg 
gatggcagag 



gacctaggca 

ccccaaacca 

ttcccaagct 

tcccaattag 

cagaggcaat 

gtgtgtcaac 

gcaccccagg 

ggggtgttgc 

aaacaggctg 

gctttgcaag 

acccagggcc 

ctgatgtcaa 

aagggggggg 

ccacaaggtc 

atatagctga 

gacatgggtc 

catgcccacc 

tgggtgccta 

actgggactt 

gacactgaag 

catgacaact 

ggagagcccc 

ctgaactcta 

ggggagcaag 

caagaaactg 

aacagatcct 

cagtctcagg- 

cttctgctcc 

gaactcatac 

cgaaaaatag 

gcgcaggatg 

tctcgagtcc 

gctggtggaa 

gccatgtaag 

acctcccgtg 

ggcacggcag 

caggatgaca 

gccccgtgag 

gggaggctga 

gtaattaagg 

tcatttaaag 

ggcaattagg 

cttccggtaa 

cccacccccc 

cccttaccca 

cagaaaagtt 

gtccgcgcgc 

cttcttgagt 

cacaagccag 

taaagaatga 

cggaccgtta 

tcacccagca 

gctcctgaaa 

ccagcttatc 

tcaagagcag 

cccgtgacaa 

acaacagctg 

gccttgggct 

aatgtccttc 

ccctgtgttc 

cacacaatgt 

aggagtgtaa 

ccagcaaata 

ggacaggctg 



aaaatagcag 
caacacaacc 
ctacagcgtg 
cgtgttgggt 
attcagcata 
gcacatgtgg 
ccttttgatt 
tcacagtgca 
cacactcttg 
acagcaggca 
agctctgatc 
tgcagaattc 
ggacgggacg 
cctgtaacta 
gacccagtct 
tccattctct 
gatgccatcc 
gaggtgctgt 
ggtagagagg 
gcagtttcct 
tgccggctct 
agatgtaagt 
acatggtcca 
cctttgtttc 
cctctgggcc 
tccaggtggc 
atttcacaca 
ccccaaacaa 
agttttcaga 
gttagctgac 
ggctgccagg 
ctgggtttct 
agcgtaatga 
tacacagcat 
aaagagacaa 
caccccctcc 
gcagcccact 
gtctgtcacc 
ggaaatgaat 
agcgaagcat 
agtgtcctgg 
ctggcacggg 
ttgctgctgg 
acttctttcc 
gaaaatcacg 
catattaggt 
atataaccct 
tgccacctgc 
gcctgattct 
tgcttctatt 
ccttagctct 
ttgccactcc 
aacagtgttg 
tccccggtgc 
agtgtggtgc 
ccagtacttg 
tatccaggtc 
gcatccctcc 
ctgtgggttg 
ctggtgctgc 
ggttgtatct 
ggactgtgta 
caatggactc 
tctgagtttc 



cgagatacag 
cggaagtgtg 
gtggctctaa 
aatctttcaa 
aagccctaga 
cttgtggagg 
ctcaaagctt 
gaaacccaaa 
ctagtacatc 
cagagttctg 
tggcttgacc 
agcagagtgg 
ggccgaggcc 
cactaactgc 
tggtgctgtg 
ctcctcatcc 
ttctgggagg 
gtctgcatcc 
ttctccacac 
ctgttcagtg 
caactcccat 
atggatggga 
gcttttccat 
tgacccattt 
tctcctggac 
tgcagtgaca 
ctgtatttcc 
gtgtgggaat 
taccgaggag 
aattacttgt 
gctgcagtct 
tgccgggatg 
atcaacgctt 
cgccagccac 
caa teat cat 
ctctccgctg 
ctcagcctct 
cgaaaaccac 
tccgtaggta 
aggtgcgtga 
tgcttcagtt 
tagagctcac 
gcagctctgg 
tgacagctct 
ttggaagaaa 
tcggatctgc 
gtggcaacag 
caggagacag 
ccaatattga 
tccatgctaa 
gctgttctcc 
ccccaccctc 
gctcaagttt 
tttcaagatc 
caactctgat 
gctgttgtgg 
caaacagagt 
tcccctcctg 
tgtgcctttt 
tgttgagatg 
gactagaagc 
aagagatgag 
tcctggtcga 
ctatggctgc 



98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
98880 
98940 
99000 
99060 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
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accgataaat 

gactagaaat 

tcacactgtg 

gggacacatt 

gagaaatggc 

ctagcagcca 

ctaatacata 

cacacacaca 

ggctcagcaa 

catggtggct 

acagatacaa 

gaaagaaaga 

aaagaaagaa 

acttccaggg 

gaagctgtag 

gtagacaacc 

caacgcctca 

taggacactc 

agacctatct 

catccaactc 

atctgctctg 

gaggactttg 

ctctaccctg 

ctgtgcccac 

agactgagaa 

aaatgtctta 

tttgtgtgtt 

taactaagga 

caggtgtgtg 

tgcatattag 

tttgagacaa 

accagaagaa 

atttgaactc 

agtccttgag 

gcttcagcct 

cttccttccc 

cttcccttcc 

cttcctttcc 

taatcctggc 

nnrmnnnimn 

nnnnrmnnnn 

cgacaacacc 

ccaattttta 

atagctcctt 

gtcctagaac 

atgatagact 

aaagtcatcc 

ctcaaaaaga 

gggaggcaga 

ccaggacagc 

agtgaacccc 

tgtttaccca 

tgtttgaatg 

ggcagtattg 

gagtttaaag 

tggcctgctc 

ctgtttcctc 

tatactacac 

tctagagatg 

cctccatctg 

atgagacctt 

tggtggtggc 

agaaaggctt 

aataatgatg 



ggtcataagc 
ctagctcaag 
acatgatgct 
catggaccgc 
ctggtagtta 
tgttgggcag 
catacataca 
cacacacaca 
ttaagagcac 
cacaaccatc 
tgtactcata 
aagaaagtgt 
agaaagaaag 
ctataggcaa 
acaacctgtg 
tgtggcctgg 
ccctctgtgc 
tacatcaaga 
ttacatgtga 
actatgtgac 
agagtcccct 
gatctgcaaa 
aaaccctttt 
aagaagtccc 
cattgtgtct 
ctgcctttat 
tgtttgtttt 
tgactttaaa 
ccaccgtacc 
gtaagcatgc 
tgtctttttt 
ggcattggat 
aggacctctg 
acaatgtctt 
cctaaatact 
tttcccttcc 
cttcccttcc 
ctctctctct 
tgtcttggaa 
nnnnnnnnnn 
nnnnrmnnnn 
ggcgccgctg 
aaaccagtgt 
actttgtttt 
tcaatgtgta 
gatggtgtaa 
ttggctatgt 
aataaatgta 
ggcaggcaga 
cagggctaca 
aacagtactg 
ggtgtaccca 
agagaaaagt 
gctggagagg 
cttcctccac 
ttggctccgg 
ccccacccct 
tggtgatctt 
tcctttcatg 
aactttagga 
ggagcctgca 
agtagtagaa 

tgggtgcaca 
acgatgaaaa 



agagtagagg 
gttgtgtgtg 
gtggtctgca 
aggttacaca 
agtatgcttg 
cttacaacta 
tacatacata 
cacacacact 
tgactgctct 
tgtaatggga 
tacattaaat 
aaacgaggaa 
aaggaaagga 
cctgtggcct 
gcctggggaa 
ggaagctgta 
ccaatttcct 
tgatcttgtc 
ggtcactcca 
agagtcatct 
gcctgcccgt 
gccacatctt 
ctctgaggtg 
ggggagaagg 
cacctaaaat 
tccttttcct 
cttctgagat 
cttctgatcc 
ccagctttat 
taccaacttg 
tttttaatta 
cctattagag 
gaagagcagt 
gctatatggc 
gggattacat 
cttttccctt 
cttcccctcc 
ctctccccct 
cttgctctgt 
nnnnnnnnnn 
nnnnnnaaga 
cctccactgc 
ttcaagccgg 
gctttgacgt 
aattaggctg 
aactccactt 
tgtgagtttg 
ttgccgggca 
ttttctgagt 
cagagaaacc 
ccggacagtc 
caaggtgtgt 
ttcccatcag 
tgatggggag 
gaagaccact 
cctcagttta 
ctccctgtct 
gactgtattt 
gctgggacct 
aacttcttgg 
ctttgttaag 
ctctgatgaa 
tggctataac 
taattctctg 



aaaaccacag 

gcagggttgg 

gatgtgttgg 

tgctatttaa 

ctgatcttcc 

actatgactt 

catacataca 

ttaaagaaaa 

tacagaggtg 

tctgatgccc 

aaataaataa 

aattcctaat 

attctgaggg 

ggggaagctg 

gctgtagaca 

gacaacctgt 

gttctctaag 

tcaagatgtt 

caggttctag 

agagatttgt 

gggaactccc 

ccaaaaccat 

gcttttagag 

gacccaaggg 

cggtggtcgc 

ccgctccatc 

gtagcccagg- 

tttcttccct 

ttgagactat 

gctatattcc 

tatgagtaca 

atggttgtga 

cagtgctttt 

acatattggc 

gtgagccatg 

ccctttccct 

cctcccttcc 

ctttcttttc 

agaccaggct 

nnnnnnnnnn 

agaagaagaa 

catccacctg 

tacactgaag 

tttgtgacat 

gccttgaact 

aggaggcaga 

aggaccaacc 

tggtggcaca 

tcgaagccag 

ctgtctccaa 

tggtgtcttt 

gagcagctct 

ctcgggtgtg 

gtgtagcctt 

gggctgatct 

tctaaaattt 

gtattccctg 

cagtttacct 

ggctcagaaa 

cttaaaggtg 

caccctgggt 

cagttagtta 

tcagtggcga 

tgactgttct 



acagaaattc 
ttcctgggtg 
gctgcatcca 
aaactccaag 
agaagacctg 
ctgagctcca 
tacatacata 
aaattctggg 
ctgagttcaa 
tcttctggtg 
gaaagaaaga 
taaaaaagaa 
agaatctgcc 
tagacaacct 
acctgtggcc 
ggcctgtggc 
gacacatgcc 
taacaaaatt 
acatattttt 
gtccaggaca 
cagtggtcct 
tttcctcttt 
aggcaggtct . 
ccagtgctga 
aaggaccaag 
ttactcctca 
ctggccttca 
ccacttccag 
gattcaggct 
cagcctttct 
ctgtatctgt 
gccaccatgt 
aaccgctgag 
ctcaaactca 
gtgtttggct 
tttccctttc 
cttcccttcc 
tttcagagag 
ggcttgnnnn 
nnnnnnnnnn 
gaagaagaag 
agacaggact 
tagtagtccc 
ggtgtgatgt 
tgcttctgcc 
ggtgggagga 
ttggctacat 
cgcctttaat 
cctggtctac 
aaacaaaaaa 
cctaagtctc 
atacccagag 
tgaatactcg 
ccggtggaga 
tgttaccaac 
acatgttacc 
ccctctagtg 
cttgttctct 
tttctaaggc 
tatttctgac 
ggtggtggtg 
ttcaaggccc 
gagacgtgct 
ccccacttcc 



atgccattga 

ttcaaccttt 

tagctaccct 

ggaagggcta 

agctctgttc 

aagctctctt 

cgtacacaca 

ttggagaggt 

ctctcaacca 

tgcgctgaag 

aagaaagaaa 

agaaagaaag 

ccttttccta 

gtggcctggg 

tggggaagct 

agcatcatgt 

atcaaatgca 

acatctgcaa 

gaggagccac 

gactggctgt 

taagggccct 

tggagagcta 

cagcagggca 

actatcgctg 

caggctctat 

tttttgtttg 

gctcactatg 

agtcctgggg 

ccatacttca 

ttctttcttc 

tttcagacac 

ggttgttggg 

ccatctcgcc 

gaatccttcc 

tctagccttt 

ctttccttcc 

cttcccttcc 

tttctctgtg 

nnnnnnnnnn 

nnnnnnnnnn 

acaacaacga 

caaatccaga 

acttgggatt 

agtcttggct 

tcctgctggg 

tcagaaattc 

gatatcctat 

cccagcactt 

agagtgagtt 

caaacaaaaa 

ctttcaactc 

gtgatacggt 

gtccccagtt 

tgggctttgg 

agaattggat 

tgatcaaaaa 

gtgctggctg 

ctctgctgac 

actgagcctt 

ttagtatgca 

gtggtggtgg 

atctagaaaa 

tcccatgaat 

ctctctctca 



102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
104580 
104640 
104700 
104760 
104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
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ccctagctct 
cggaacgtta 
tggctagtca 
tcagcaagaa 
gctaagtggt 
ctacactgtt 
aactggaatt 
gggatatttg 
gtagaggcca 
ctctcgatca 
ttggccttga 
agaggcctcg 
gcttgttgaa 
tctgccaagt 
cttggagctg 
taagtggcat 
atcagctgac 
ggattctgac 
tccctgcatc 
ctccccagca 
atctcctcag 
ttgatgaaga 
ccacccatgg 
gtaagaagga 
ctggggcgga 
tttgaagcaa 
ctatctcagg 
gctcttagaa 
ggagggagtg 
gccaagtcat 
cccctaggga 
ggcctgagtc 
ggtttctggt 
tctctaaaag 
ttcagcgact 
ctagacctcc 
ctcgagctag 
gaaaaaaaaa 
gtggctggct 
ttagctcagg 
tgtagccctt 
acagagcgcg 
gcactgcaaa 
gctggtctaa 
ttggttagtt 
ccaatcagag 
tgggtgtggt 
ctcgtgcgtc 
ggaagagaca 
agcaaccccc 
aaccgcggta 
ggggagggga 
ccgagtagtg 
ttggagaagt 
cgccctttgg 
gaggggtgcg 
cctggcaggg 
ttatgtaagg 
aaaggttagg 
ctggaaactc 
gtcacctttg 
ctgactgcct 
cctctctccc 
cgttcccacc 



tatctaccga 
gtcacttccc 
ctgaggaaaa 
agaagctgac 
ttcaagaagt 
ttgtagatga 
tgtttggcta 
gggacacttc 
gggtggtact 
aaacatcaac 
ggtgggtggg 
agtgaggagg 
acttttcttt 
cagtcccggg 
gggcaacaca 
ctgagaagtt 
ctaggaagga 
ccactaggct 
atccgcagga 
gtggcctccc 
aagtctgaat 
aaagatcatg 
cggggttgag 
ggatgtggtt 
aggcaggaag 
gcccagcagt 
cttccctctc 
atgactcaaa 
aagaggtggt 
tgggttcatt 
aggagccgcc 
agagaagcgt 
tgatactact 
tgagggcgaa 
aagacgagac 
agtgaagatt 
ctctccagtg 
aagtcctatt 
tccatcttta 
cctcctgtga 
taaggacttc 
ctgcattctg 
aatacacgcc 
gcaagtcgcc 
ggtaggcgtg 
cccagacgcc 
cacgttgccc 
acgacgccgc 
gagcggtcgg 
gcacccagca 
accgggtgat 
gagggggcgg 
tgtctggacg 
aactggggag 
tttgaggatg 
gaggtgcatt 
aaggggtcgc 
ggggagggga 
gtttcaccgc 
tacccgcacc 
cacctccccc 
gggaagccgg 
cacctcctgg 
tccctctggc 



atccctgcac 
ttcgcctact 
cggcagctgc 
agcaaatgtg 
ttcttcttaa 
agaaaacacc 
attcagtggt 
tggtcgtcat 
gaccttccta 
agtgctaagg 
aggaggttag 
gaacagggct 
cttccaagct 
acttcgcact 
tttacctggg 
aaaagtgggg 
agagagcaac 
tatcttccac 
gcccctggag 
cactaactga 
aagtaaggga 
agaagcatac 
ggggaagggc 
agggctgatt 
tctccttaaa 
caccttagtg 
atttctttgt 
agtcaagaac 
cccagtcttg 
tcggtttttg 
tcttcattgg 
ttcggagggc 
ttggtaaaga 
tcctttgtta 
attcactaga 
gggcctgtgc 
aagactgggg 
tgtggaaaaa 
gttctactca 
atcagttttg 
cccgcgaccg 
ggaagctgag 
tatacttcct 
taggccgagg 
gcttagagaa 
tgaatgggcg 
agcaatgggc 
cagctgatcg 
ccgtgcggac 
tcaggtgggt 
ggggaaagta 
cggggacagg 
tttaaagaga 
gaatatgaga 
cccgagctga 
gcctctaccg 
tgggcggtcc 
aaggagtagg 
gggttgggcg 
ccccctccca 
ctagtatgtc 
ctggctgggt 
ccaactcttg 
cctagtttca 



aagcacccat 
tcagtattgt 
ctgtgggcca 
ccaactgtgg 
ccccctacaa 
attccgaagc 
tctcaaccag 
aattgggatc 
ctatgaacag 
ttgagaaatc 
agagtggaat 
cctatgctag 
tggctagagc 
aagtttgtcc 
tcttcccagg 
tgagtgggta 
tgaagcagca 
agcctttctt 
cacccacatt 
cagtggtgac 
cagatgttgg 
attttacccg 
agaaaaaagg 
cgtcccctgg 
gacacaatat 
gagcccatac 
ctcagatgct- 
cccaggccac 
tccctttaaa 
cccatccccc 
tctccacctt 
gggactgaat 
tcttccccta 
agaacgtgcc 
aaagatttca 
gggtgacatt 
ccgtgcgagt 
aaaaaagact 
ctcctgttgc 
aggctaaaag 
agtcagagat 
tgtcaccgta 
tctgctcttt 
gttagccaca 
gctcctccag 
ggagtaagca 
ggtgattggc 
gagactggag 
aggtcgcagt 
gtgatctggg 
gggtcctgac 
ggcgctcttg 
gagtcccgga 
ggccagaggg 
ccatttagcc 
gcgctgactg 
ggcccctcct 
gggcggcggt 
gaggttgggg 
gcctaactgg 
cggtagagag 
ggggcgcctg 
gcccctcccc 
gtctccaact 



ggggtttaca 
gtttccagaa 
ccagcccatg 
gtctgctgga 
gaaaccacaa 
tcactgccag 
ggtggatttg 
atgtgctact 
ggcagccatg 
tcaggctgaa 
aaaatcagaa 
ggataatgga 
cctgctcaat 
tggagtgacc 
agtgggttaa 
aaggcaggag 
aagagctggt 
gtttaggctt 
cagccggccg 
aggaaatatc 
ggagaggcgt 
ctatggttgg 
agatggagaa 
ggtcgaacaa 
tctgaacgtt 
ttaatttaac 
ctcacttaga 
aagtttctct 
taagcaattc 
gcctttcaga 
tgaaatcact 
gggtgttaat 
atttttaaaa 
ccttgagaag 
ctaaacccac 
tgtccctata 
gacagtggtc 
tcgggtgttc 
ttcgcgtgct 
aagttccaag 
cagtttaaaa 
agaacttcat 
aaactgtagt 
ccttttcagc 
gcaagggggt 
gaggtgctgg 
cctgggtggt 
ccggtgtgtg 
gattttgctc 
gacccggtca 
ggccacaccc 
ggagaggagc 
caggagtcgt 
ccgggggcgt 
tagggaggat 
ggtcagggcc 
ctcgttccct 
gcggaggcct 
ggggcggaca 
ctgtcttgga 
gcccctagcc 
ggttagtcat 
acggcctccg 
catttggcct 



gaatctgggg 
gtacccattt 
ccaagtgagg 
tttctactgt 
attttattat 
tcagcactgg 
gactccccag 
ggcacctagg 
tacataaatt 
accctgtcat 
gggccaccac 
gaatagggca 
ttcccccaac 
ctgactccag 
aagtcaaaga 
gaccccatca 
ccaggagact 
gggctcagtt 
ccaggacagg 
tcccattcca 
cactcttggg 
ggttccatta 
ggacagacac 
tcagctgtta 
gaactcagga 
acagaagcgg 
aagcttagat 
ttgggggtgg 
agcagctttt 
ctctgattgg 
tccctaagta 
cttagaaccg 
agacgcttcc 
ccgtgggctc 
gagggataga 
ccccgaagac 
cctatacccc 
tgctgcatcg 
ccaccttcgc 
aaggaggggc 
atgccaactc 
tgaccggaat 
ttgacgtaaa 
cattggccag 
ggcctccttg 
cgcccccgag 
tcattcgcag 
ctgggcgctg 
ctctgtccac 
tcccgggggg 
tgcccttctg 
ctggactctc 
ggcagaaggt 
ctaaccccga 
ctggacgagc 
agttcaagtc 
cccggggatg 
tatgcaaccc 
ggaggagtgc 
cagagagaag 
cgggcttggc 
cgctgggctc 
gttaggctaa 
gtcaccctgg 



106020 
106080 
106140 
106200 
106260 
106320 
106380 
106440 
106500 
106560 
106620 
106680 
106740 
106800 
106860 
106920 
106980 
107040 
107100 
107160 
107220 
107280 
107340 
107400 
107460 
107520 
107580 
107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
108120 
108180 
108240 
108300 
108360 
108420 
108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
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ctgttagagt 

gggtcagtgc 

tcagcagcca 

gagactgatc 

ggaatttgaa 

aggcaaagtc 

gaaccaccct 

ccatgctgta 

cttggaactt 

cagatgtacc 

cagccaccct 

tcctgagagc 

aaccctcctt 

atttctgaag 

atgccgaaga 

aggggcacaa 

gaaggtactg 

acccagcttt 

ccctcacctg 

aaagctgcca 

gcagcagctt 

tggggtgatg 

gcccagtgaa 

ctccccaaga 

tgtttactga 

cacacacaca 

cacacacttt 

actccgttct 

tcgagacagg 

tggcctcgaa 

tgccaccacc 

tgggtttagg 

ccatagataa 

tcttcttctt 

caccccctga 

gtcccaggca 

ccaaggcttt 

tgccatttat 

gcaagaaatt 

cctgattcat 

tcctaacaga 

ggagttctag 

ctaatccagc 

cgggctgggg 

cgcttgcttt 

atggctctgc 

caacagaggc 

acattggtca 

gcctcagtga 

tgaagggcct 

aggagcatga 

gtaacactgt 

aacatgcacc 

ggtctgcaat 

ctgctataca 

caaaataaaa 

tccctgatca 

tggcagtacc 

gtatcctaaa 

tggaacgcag 

tagggcttgc 

gtcttacaga 

gtatgagttt 

accatccaat 



aggctagaag 
cctgggccca 
gtgaaccctg 
acctaggaga 
agttgcccac 
tccttggccc 
ctcgggctcc 
gggccccggc 
ggaacaaagt 
cggaatgtgc 
acccatacct 
ccctgctgac 
gctgctgaat 
tacatttttt 
tgctgggctc 
agcaagtgca 
aaagtcagct 
cctgtttttt 
gctgagccgc 
gttgaagaac 
tgcctatcat 
gctttatggg 
agatttaggt 
ttagttgatg 
gacttccttg 
cacacacaca 
cctttccatg 
aagtagtcat 
gtttctctgt 
ctcagaaatc 
gcccggctgt 
ggtacttacc 
gtaggcagga 
ctcgtggatg 
ctactcaccc 
agctcgtggg 
coot cca cot 
tttctgcctt 
gtcaggggag 
tctcacgtat 
agatgggttc 
cctgttcccc 
cccactgcag 
aggctgagtc 
ctttcctcct 
tgtcctcatc 
aggcacaaga 
gagaggcacg 
aagacacgtg 
ccaggaagta 
gccacctctg 
gggaaaacat 
aagccctggt 
cccggcactc 
atgagttctg 
taaagcattg 
taatacaaca 
aaccattgtg 
ataaggattt 
ctgtttgttt 
ctaagtaaat 
agaaaaattg 
gtgactgttt 
tgagtggaaa 



ctgtcatggt 

ccccgccccg 

tggcctgaac 

aggaagatcc 

cctagtgtaa 

ttctatctgg 

cagccctcta 

ttttactgct 

tcagacgtgg 

aagcggaatg 

ccctcccctc 

cttccaactc 

tccctaagaa 

tttttaactt 

cttagcaggt 

tttagaagcc 

agagccaggt 

tttttcctcc 

agtagttctt 

tgttgccctc 

ccggaaggtg 

agggaaaccc 

taaaggcact 

tctgtgtggc 

taggcctgcc 

cacacacaca 

aggtccaaaa 

ttgtgtattt 

atagccctgg 

tgcctgcctc 

atttacattt 

cttacacctg 

gggcattaaa 

gaaacgaaac 

agctcagggg 

aggctagtcc 

aaagcaaaac 

tgctttccag 

atgggttgta 

gggcttggtc 

agggggtaca 

atttgtgaga 

tctaagctga 

ggctcacaca 

cccttccctg 

caacatggag 

cagtggagga 

tagaagcctg 

cttccagcca 

tggtcccatc 

aaaggaaata 

atctatgact 

tctgtcttct 

tggaggtggt 

agccagcctg 

gttagtaatt 

agcaaatgaa 

agagatgcct 

ggttataaaa 

ggaagtccat 

tatattcatc 

agtggaaata 

ttaaaatatg 

tataaatact 



gccagagagt 

cagccaaggg 

agagctatcc 

gacaaagttt 

tctttccact 

cagtggccat 

gcctgccacg 

gattcatgcg 

aggggccggc 

cctggcatct 

ctgcctttgg 

tagtgcccct 

caagtcattt 

gggacttggg 

tgccaagagt 

tcttgcttct 

ttggatggcc 

ccttcctttt 

cagtggcaag 

tgcccctggc 

acagaactgg 

tggtcctctg 

gtctataaat 

agtgggaaga 

tttcttatct 

catacacaca 

gtaaatgtac- 

actttttttg 

ctgtcctgga* 

tgcctcccaa 

ctttatttat 

tggatttttc 

agtccaccag 

agctcttcac 

gattaggatg 

tctactggct 

tgtagctctt 

gatagtgaga 

tgatatgagt 

tctgattgtg 

ggaggctgtt 

aactgaaagt 

gggataggat 

ttgcgacaaa 

gccacagtgt 

cctcagaggt 

cctggcctgg 

gagaacacca 

tctcctctca 

tctaccctgc 

cacagcaaat 

ggggttgtag 

gcattgcata 

ggcaaaggag 

ggctatatga 

caaagaaagc 

agccagaaga 

ttggacctgg 

tgttcatctc 

tccttttctg 

tgatggtggt 

taaatattga 

aacacatact 

gaatactaaa 



tgatggagca 

cacctgcttg 

tgggcagaga 

atacttccca 

ctctgaaaat 

gtccttggac 

cccccagccc 

ttggaactgt 

agacagcctg 

ctagtcctga 

tcagctgtcc 

cccatttcta 

gagttgatca 

ttctacaccc 

tgccagctcc 

tattcaagaa 

tctgggtcgc 

aggaacctgt 

ctttatgtcc 

ttcgtggagg 

ggtgggaagg 

gggagccctt 

tggggaatag 

aatagaagga 

tcatcatcac 

catacacaca 

tcaggaaggg 

tttatttgtt 

actcactttg 

gtgctgggat 

ttttagtctg 

cacctgtata 

tggtgactca 

atgaactgtt 

gaaggaaagg 

tctcaccatg 

ggttgggttc 

ctctgctcaa 

cccttctgct 

gttcaccttt 

tgttgtattt 

cataggggag 

gtgtaaggga 

gattgccctt 

gtccctccag 

gagaaagggc 

aaccacaagg 

ggaaagagag 

ggacctgcct 

agtttctata 

tcaaaaagag 

ctcagttggt 

aaactgaaca 

cctacattca 

gactgtctca 

agatgtggct 

aggctcctgt 

tagtttgctg 

agggttgtaa 

ctgtcatgaa 

gttctgtgca 

atactaaaaa 

tgtaatatat 

aagattatga 



gctggtcaga 

gcacaaactc 

gaagtggaca 

agaggctttt 

agaaatccca 

tgactgtgca 

cctccctgag 

gggggcgggg 

gaattcatac 

ggaagctgcc 

tccctcagac 

accctacaca 

cagagctcat 

tgccctttga 

tagtctgtaa 

cccctcatta 

tggccctgtc 

gcctcccaca 

tgacccagct 

aagaggagaa 

tctggacagc 

acccccactg 

gtgactccac 

aaagtctgtc 

catgccaaca 

cacacacaca 

ggacattgaa 

tgattgactt 

tagactaggc 

taaaggcgtg 

gcccagattt 

atggggaatc 

gagcctgggc 

gtccttcccc 

ctatggttaa 

catgggtggt 

tagcaaccac 

tactgtgcag 

gcctctagct 

ggcccagtct 

gacaggagga 

actagatcat 

ctgtagcaga 

ccctcgacct 

cactgggtac 

agcctggaag 

gcctatccgg 

cagccagcca 

tcctgggaga 

aacagcctca 

attcaaatgt 

aggtttgctt 

ggttggccca 

aggtaatcct 

aaaaataaaa 

gaaaccgttt 

gccttgtgtg 

tcttagaaat 

tagagaaaaa 

aatgtatagc 

gccatccaaa 

gattataaaa 

ttttttaaaa 

aaagtatgag 



109860 

109920 

109980 

110040 

110100 

110160 

110220 

110280 

110340 

110400 

110460 

110520 

110580 

110640 

110700 

110760 

110820 

110880 

110940 

111000 

111060 

111120 

111180 

111240 

111300 

111360 

111420 

111480 

111540 

111600 

111660 

111720 

111780 

111840 

111900 

111960 

112020 

112080 

112140 

112200 

112260 

112320 

112380 

112440 

112500 

112560 

112620 

112680 

112740 

112800 

112860 

112920 

112980 

113040 

113100 

113160 

113220 

113280 

113340 

113400 

113460 

113520 

113580 

113640 
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tttgtgactg 
aaagtggatt 
attttcccct 
ttaaataaaa 
tgcctgtgag 
gccgtctctt 
ctttgtgcac 
ctggagctgc 
ctagaagagc 
catgtttaga 
aagaaaatgt 
tgtgtgtgta 
ctccaccaca 
tacacacagg 
agaataagaa 
nnnnimnnnn 
nnnnnnnnnn 
aacatagctt 
ttaattttta 
gttaatgcaa 
taaaagtcat 
atgcacacat 
taaataattt 
tgctgctttt 
ttagaaaagt 
cgtcagcaca 
caactcactt 
cacctgaagg 
aagccttagc 
tatttctcga 
caaaaaaatc 
ggttaggggc 
actctgacta 
ggttgtatag 
aaagatttat 
gatcccatta 
ttggaagagc 
taaaagttta 
tgtgtttgaa 
ttgattcttt 
cagctgagct 
agaaggaatt 
gtatgatatg 
cagatcacta 
taatcaaaat 
cctgtaacct 
aggacaacag 
gaacaaaacc 
actctgggac 
agcttccacc 
ctagcggaga 
gggtgcatga 
gaagccatgg 
gtaatgaatg 
gaaacttgga 
ttaaaagaaa 
aagcatcaac 
tcgggctggt 
ttcaaatccc 
ttctggagtg 
aagtattaaa 
ttttgttcct 
tgctaaatag 
tggcctaggc 



tttttaaaat 
caaaatgtta 
tgttttttct 
tacaaaagac 
tggcaggaat 
cactccctct 
cagagtgtgg 
acgctgtttg 
agcaaatgct 
ttttggagaa 
ttgcttggga 
tgtgtgtgtg 
ctaaagtcaa 
aagcattcct 
agaagggaaa 
nnzmimnnnn 
nnnnnnnnnn 
aaagaattct 
atgggaaaaa 
acaggacaga 
ctatttatac 
acaaacagag 
taagctctaa 
gccattcttt 
aaggagcaca 
cagcaatgtc 
ggactggtac 
ctgcagcagg 
tgcagtggaa 
agtgaatgta 
caacttttgt 
acaagctccc 
ctacacctgg 
caaattttta 
ttttatgagt 
cagatggttg 
agtcagtgct 
aaattaattc 
gtatatatgg 
cctttcactg 
atcttgtggg 
catgtaattg 
gtaagaaaaa 
tgttcaagga 
gatcccatcc 
caagtccccg 
gatatctgag 
aacaacttca 
acagtgtcac 
tgggacacaa 
ggctgcaagg 
tgtgaaactt 
ccaacaaccc 
ctatgtctaa 
cagaaaagcc 
tttcacaaaa 
taagaggagt 
gagatggctc 
aacaaccaca 
tctgaaggca 
aaatatgcaa 
gttgtttgtt 
tggaagatga 
tactctgcct 



atgaatgcat 
agaatggttg 
gtctttctaa 
aaataaatgt 
ccaacattgt 
gtaattttta 
gtgcacattc 
gatcttctga 
cctagccact 
tttgcttgta 
tacctattgc 
tacactggag 
ttctatgaac 
ctgactttct 
tgctcaaaac 
nnnnnnnnnn 
nnnnnncccc 
agagaaaaaa 
gtaatatcaa 
ttatcttggc 
gagatgaaat 
tctttttatt 
aatttaaaaa 
ctgataaaag 
tcctatgagc 
ctggaccatt 
ggggtgaggc 
ccagttccgg 
cctatgcaaa 
caggcaccat 
gagacaggct 
aaggagtatc 
cttttaggtg 
ttcaccaagc 
acagtgtggc 
tgagccacca 
cttaaccgct 
tcattatgca 
ggatgtagtt 
tgggtcctcg 
ccaccaaact 
aaatagaaaa 
ccaagaaaaa 
tggtgtccca 
agacctgcta 
ggcagcacag 
aaggagtccc 
ccatgatgaa 
acactacagc 
tgtcacacac 
gtggagggca 
acaaagaacc 
cagcactctg 
agagatgata 
aaatgtaaca 
aggcacagaa 
caggaacagt 
agcggttaag 
tggtggctca 
gctacaagtg 
taattggaat 
ttccgagcct 
ccttgaactt 
gactcaggct 



acttgtaata 

tttttgtatg 

tttttaaata 

tttaaaattt 

cttctgaaag 

aaaaatatat 

tgcagaggcc 

tgtggatgct 

aaagccatct 

ggaggatggg 

ttcttgagtg 

ctaggaatca 

ttcattaatt 

gactgtcagc 

ctgtcactct 

nnnnnnnnnn 

tattttacca 

aaaaaagtcc 

agaaatccat 

caaattaaac 

gtgaattagc 

cttaagccta 

atagtgaaac 

tcccaaaggg 

tggtgacctg 

tgtgaggagc 

tggcgatggg* 

caggttttcc 

cacacaggaa* 

ggcacgcatc 

cccttcattg 

tgctctccta 

ggttctgagg 

cagacaaccg 

tgtcttcaga 

tgtggttgct 

gagccatctc 

cgtgccacag 

atgtgacggc 

gagcgtcaat 

tatttttaaa 

gcagagttga 

acctctacaa 

gaaaagggtc 

cagagaagct 

cagagctgtg 

agtgagcgtg 

catcatgaag 

ttccacctgg 

tacagcttcc 

ggtacaaggg 

aataagttag 

gaggcaggca 

aagtcccatg 

aatgcatggt 

tgtaaggtga 

gtattttaac 

agcaccgact 

caaccatcca 

tacttacata 

tattctttgc 

gtttcatgta 

ctgattcccc 

cagtgaattt 



tattttttaa 
gtgggatagt 
ttgtgcattg 
ttactctttt 
cagctagcgc 
ttgtatgtta 
agaagagggc 
cagaatcgca 
ctccctctag 
ctacaccaag 
tgtgtgtgca 
tatccagggg 
gtctgaatcc 
cagctaagga 
ntggggnnnn 
nnnnnnnnnn 
agcattgcag 
actaatgttc 
gtctacggca 
cacagtaaca 
gtgcgcgcgc 
ccaaataact 
cccttcaggc 
tgtcataatc 
agttttacac 
cccacgctgg 
caccactttg 
aaatggcttt 
aagggcagtc 
tggaggtcag 
tcttcccacc 
agggcactgc 
atttgaacgc 
ttttttaact 
cacaccagaa 
gggaattgaa 
tccagccctt 
ggaagtgtaa 
cagaggacaa 
agggtcagca 
aagctagaag 
agagggagga 
gtaaatctag 
ctaaaacaaa 
ggggggaagg 
cccaatgatc 
tagcagaagc 
atgtgtggac 
gacaaaaagt 
acagtcagat 
aacaggggtg 
gaaagagaga 
ggagggtctc 
agagttacaa 
aggtggggaa 
aagcgagcga 
tacctttata 
gctcttccaa 
taacaaaatc 
taataagtaa 
aattctacta 
atccaaactg 
tgtctccacc 
tcaaacactg 



aaaaacactg 
acaattgtga 
ctttcatatg 
atgagtgttt 
taacttctga 
tattatgtgt 
atcagattcc 
ctcaggtcct 
tcctcattgt 
tgccaggtga 
tgcttgtgtg 
ccttttcaag 
acctactctc 
ggtgtggctt 
nnnnnnnnnn 
nnnnnnnnnn 
gataaataag 
aatgttttaa 
tattaatgtg 
cgttttgaac 
aggtgcacac 
ctttaagcag 
tatgacagaa 
tgtatccttc 
ccaagtcaca 
tcctgagcaa 
gactgtgtcc 
gaggcaccat 
attcccgttt 
atgataatct 
acacttgcca 
catcagagac 
agatcatcat 
ttttttaaaa 
gaagacatca 
ctcaggacct 
atttaacttt 
cataagcaca 
ctctgtgaag 
ggcattttac 
ttggttaaag 
gaggaggtga 
tcgtgttcca 
gcccaggaag 
agggagaggc 
tccgtgatac 
cagaggcctc 
aaaagggtgc 
cacacactac 
tatttttctt 
agtgggatcg 
tgaagacata 
catgccaaag 
agtcagagga 
aggtggggat 
ctaccagata 
aaaaaaatac 
tggtcctgag 
tgatacggtc 
ataaataaat 
tgaatggagt: 
gcctggaacc 
tcccaagtgc 
cttcaacctt 



113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114840 
114900 
114960 
115020 
115080 
115140 
115200 
115260 
115320 
115380 
115440 
115500 
115560 
115620 
115680 
115740 
115800 
115860 
115920 
115980 
116040 
116100 
116160 
116220 
116280 
116340 
116400 
116460 
116520 
116580 
116640 
116700 
116760 
116820 
116880 
116940 
117000 
117060 
117120 
117180 
117240 
117300 
117360 
117420 
117480 
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gacccctaac 
ctggggcttg 
ctgctgctgc 
gtgcaaaccc 
tggctcttag 
tttttgtctg 
gctttcagtt 
agtaaagtca 
tcatcaacag 
gaaaacctaa 
gtttacaaag 
tggtggaaaa 
gcagggagag 
aggaggggga 
tgtatttgcc 
aattgcctac 
tggctctgaa 
caaaggcagc 
acaggcagga 
cctgctgggc 
tttggcaaaa 
aatacattca 
acagagagaa 
agcccttcct 
cagtctatgt 
tgaaggaaag 
acctatcttc 
gcacctcacg 
tctccttcca 
cctgctgagc 
agcctcattc 
gactcaattt 
gacccgagtt 
gggagcagag 
gttcagtaag 
ccgctgcagt 
ttgaagctgg 
acagtaaaca 
tttaatccca 
tctacaaagt 
aaaaaaaaaa 
aaattttcat 
aagtctccaa 
agcagtcgct 
aagaggacca 
tgggatgcgt 
ggaacaagag 
catctttaac 
agttcatgta 
ctctaattat 
agtaaccttt 
gaattccaga 
agctgcccac 
aaaaccattc 
tgcaaatcaa 
tggtcccaag 
agctatcatt 
caaacccacc 
agcctgccat 
ctctgccagg 
ggtgaggccc 
caatatttct 
tatgaataat 
acagcacaga 



ttccatctct 
ctggctggtg 
tgctgctgct 
agtcaagaga 
aggagcagac 
cgcatagtaa 
ttaatgaggt 
ctgggctcaa 
gacggtgtgt 
aagcagttca 
ggttctaccc 
tatgagttga 
atgagaacag 
ccgagcacgc 
cttcaccctt 
cactatttct 
ggctgaagcc 
taaaaaaaaa 
agtctccctt 
ctcctgaggt 
ttactcttaa 
gcacaaagtt 
aaagtgagac 
ttacagctca 
gaagcatttt 
tagaaatgag 
cagcgaagtt 
aaggtgcttg 
ccagagtcct 
cctttcccca 
tcagaaccat 
ctataaaaag 
ttgttcccac 
gtaggcagat 
ggagcctgta 
ggcatctgct 
aggtcaaggc 
tatattttac 
gcactcggga 
gagttccagg 
aaaaaaaaaa 
tttttataaa 
cttgataatc 
aagtccgggt 
gaagttcaag 
gagacacacc 
ttaagtgcgt 
atctttggct 
cttcatcatc 
atagaacata 
atctcttgca 
gcattgtaaa 
aagacctcaa 
tacaaagcag 
gctatcccag 
ccatgaggga 
cttacagtct 
aacttcccaa 
ggtagtgcac 
agttccatgc 
tgtctcaact 
cagttcctac 
aagtatatta 
tacaggacac 



tgggtcccat 
gtgtcgctgc 
gctgctgccc 
tttcaacacc 
tgcaccacgc 
agactggttc 
tctccctgaa 
tctgtgatac 
tcgtggcagg 
attgtctctc 
tttcccccaa 
aagagtctca 
agttcagaag 
ttctaaagca 
tatacaaatg 
aaaataacga 
agcctcgttt 
aaaatgtccc 
gccactgagg 
cccaactcac 
gtagctcaag 
gagaagatta 
acctatttac 
gccttcccat 
taaaatcagc 
cacacagtga 
cttgccttct 
cagctatgtg 
agggatggca 
gcctctcctt 
ttctgctgcg 
ataactgagg 
atggggcgtg 
cccaggcctc 
accacagaag 
catagccaca 
cagcataggt 
tacaattaaa 
ggcagaggca 
acagccaggg 
aaaaaaccaa 
aattacagtt 
tgaggctggg 
agtcccttgc 
gtgctcctcc 
agtaacagca 
tactcccgta 
gagaagagaa 
atgcagttaa 
agatgtggta 
aatctggtac 
tagcagcaat 
atcaatcagt 
taacgtaggc 
gaagccagat 
actagattac 
attatggcaa 
gcaagcattt 
acctgtaatc 
cagcctggtc 
cccaaaccac 
tgacagaaaa 
tacagtacct 
acacacagct 



cttacccatt 

agaggctggg 

atatcctgga 

agcagtaaat 

tgcacaccct 

catttccctt 

gccttgccca 

taccactatt 

atccattaga 

aagtgctttg 

acaaagcaac 

ggtaccgtga 

cagccgtgtc 

gcaaataaga 

tttaccaagt 

gtttatgggg 

aggctgctca 

agagctcctg 

ctctcccttc 

cgggcatgct 

gaatgagacg 

aatagaattg 

ataggagagg 

tatcactcaa 

aagagaaagg 

cctgctgcat 

tta'aaaagaa 

gagggcagag 

caaaggcacc 

ttctacagtc 

catgtctgac 

agccggcttg 

cgtgctagag 

aatagccagg 

acactgtgcc 

gctacccagg 

accataggta 

aaaacaaggc 

ggcggatttc 

ctacacagag 

aaacccaaca 

ataattttta 

ggtgtagctc 

ctgtatccca 

tcttcaggta 

actactgcct 

actgtctgca 

aacctggtat 

tttgtaaaca 

attaggaaaa 

aagagacagt 

tgcccttcaa 

ctatagctaa 

tccgtttata 

cgtaattctt 

agcaggctcc 

gtgagctctg 

tcttaacaaa 

tcagcatgtg 

tagacagttg' 

tgaaaacaag 

tggacacaat 

agtttaagtc 

gcaagcgtga 



cccagactcc 
gaagggactg 
gatcaaaaca 
accttaggaa 
gtgtctgcaa 
tcaactgttt 
tttctccttc 
tccatagcaa 
agagaaagca 
gcttcaaagg 
cttattttgc 
agtgctagtg 
aaggcaacag 
cgcgccaccc 
tgaagaatgt 
tttgatttta 
cctggagtgc 
aaacactaaa 
tccaactgta 
acatgcccat 
taattgtgtt 
attatgcttt 
ggcccaggct 
ctcagacacc 
tgagttgctc 
aagacacagt 
tgtggtattt 
accatcctga 
gggcttgaca 
tccaaattac 
tgaactggat 
gttggtagag 
aggtgaggac 
cagcccaacc 
gtgtaaccac 
aggctgagcc 
gacccccatc 
cgggtgtggt 
tgagttcgag 
aaaccctgtc 
catgtaccta 
gttgaacata 
atttagtaga 
gtgcttgggg 
atgaggagtc 
ggagtgctca 
caatgatgga 
tcctcagctc 
accaatcctg 
gctactaatc 
cccaaatcaa 
ttaacgcatt 
ggaaaaatct 
ataacctgtt 
aggctctgct 
gccctcggtg 
ggcagagaaa 
cacaaagaat 
ggaggcagag 
caagatcaac 
taagacgata 
taggctgggt 
tgagcaagat 
aggatttaaa 



cattctgtgg 
cttgctgctg 
ggccaattca 
aacccacctt 
agggctcact 
agaattgaca 
aagaacctgc 
caaatcgatg 
gggtggtaag 
aaaaggagac 
aactgacagc 
aactgtcaca 
ggagataaaa 
tgtcggagtg 
taacattgta 
cttctgtgat 
ttgtagatga 
actggtgcgc 
agttctaact 
ggaattcatt 
gctggagaca 
agttcatcct 
actgacaatg 
cagcccagtt 
agtagtacac 
ttaaaaggtg 
gctggtatgt 
gggaatcggg 
gatgccttta 
ctgaggtagg 
atggtcactc 
aaaacataaa 
ggactaggca 
tgagctctaa 
ggcaagcggc 
acaagactca 
tcaaagttga 
ggcgcatacc 
gccagcctgg 
tcgaaaaacc 
ttctaacata 
atacaatgat 
gtacctgcct 
aacaggcaga 
tgagccagcc 
ctctggacca 
gacagtacgt 
gttctggcta 
gcagcaataa 
cacttaatag 
atgatggcaa 
gtaacgcagc 
ttctaaagcc 
ttgggccacc 
ggctacacac 
acctgctcat 
aattcacaaa 
aaataaatag 
gcaggcagat 
aacgctatat 
tggatcaatg 
ataattccaa 
attatcgcca 
ggcacccatc 



117540 
117600 
117660 
117720 
117780 
117840 
117900 
117960 
118020 
118080 
118140 
118200 
118260 
118320 
118380 
118440 
118500 
118560 
118620 
118680 
118740 
118800 
118860 
118920 
118980 
119040 
119100 
119160 
119220 
119280 
119340 
119400 
119460 
119520 
119580 
119640 
119700 
119760 
119820 
119880 
119940 
120000 
120060 
120120 
120180 
120240 
120300 
120360 
120420 
120480 
120540 
120600 
120660 
120720 
120780 
120840 
120900 
120960 
121020 
121080 
121140 
121200 
121260 
121320 
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cctacagtat 

catttttgct 

aattcttact 

gcgctggact 

ctttgctgtg 

acaatactcc 

aaaaaagcac 

ctcacagaaa 

aacaagtcca 

gacagacctc 

taaagacaca 

gctggggtac 

cattcccagt 

gtgtacccag 

ttgggttcaa 

aaccacagag 

gggagggggg 

gtaaacaagt 

aattttttta 

ccatgtgggg 

taaagaaacc 

taggagccca 

aattaaaaca 

gttaagagca 

agctaacgcc 

gcccgcaaac 

gattctttca 

ctctgcagac 

ggaattaaag 

aaagcgtttt 

acataattaa 

gattcactca 

atgaagactt 

tctttgtgtc 

cttcctacct 

tggactacaa 

tataagtgtg 

caatattagg 

cttgaagcaa 

acatacacac 

gcggtgagga 

gacaagccag 

atgccaggga 

agaagctgta 

gatatgggtc 

agcacacacc 

tggtccacca 

gtggctgcaa 

agaactggga 

acagaaggag 

aaactggtgc 

aaaaagacta 

gaagcaagcc 

tgttaaaaaa 

aggaggcaga 

agtgggttcc 

agagaggaag 

cagcgttatt 

aagatgaaaa 

gctcattctt 

gactagagtg 

aactggagtt 

aggcctgaag 

ttacaaacag 



atgcagcatt 

caatatggca 

agcctgggct 

agcttccaga 

gtgaaaggtg 

tggcttggta 

agagctcaca 

tcaagaagaa 

cttcagaggc 

actcagaaca 

gaagatgggc 

gctgtaacaa 

taacccagca 

gaggtaattc 

attaggatca 

ttggttaaat 

ggagggaagg 

ttcaggctca 

aagaggacag 

tacaggagca 

ttaaactgtg 

cccattgcac 

aacattaatg 

cagcgcccat 

ctcttttggc 

attcaaagaa 

tttttttgag 

caggctggcc 

gtatttaaca 

gttgtttttt 

aaatgccaat 

cacaacttat 

ctggatacat 

agacattagt 

attgcagtgg 

ggcagcaagt 

tgtgggcact 

catcaagagc 

aagaaactca 

acacacacac 

actgtaaaaa 

ggcctctgcc 

ccactctaac 

gagagttgat 

agccatgctc 

ggacctggca 

tagtggctgc 

agaggacaaa 

agtacccgct 

agactggtta 

ccatcactac 

cagagtacgc 

agactggaga 

aaaaaaaaga 

ggcaggtgag 

aggaaagcca 

tggatccaag 

aaatgacagc 

aataaacaca 

gttatccaaa 

agacactgct 

tgagggaggg 

cccgaaggtt 

tttaaacaca 



gactgtctag 

gcagtaataa 

gtgtttgctc 

ggatgtaaat 

ctgtttctga 

acaatgccaa 

tctgagccaa 

aaacaagcaa 

caaaaaccaa 

caagctatag 

tgctggcatg 

gcagaggaga 

atcacttctg 

actgcggctt 

agcacacttt 

aaacaacaga 

gggagaagct 

gtcggggtat 

aaaaaagaca 

gcatttgcat 

ttatctataa 

gccatagttt 

cttgttaaag 

atggaggctc 

cttcaagagc 

aaatacaaac 

acagggtttc 

tccaacccag 

cacacattat 

gttaagtttc 

gtgaaataaa 

atacttaaat 

tagaaacgta 

gtgagtgtct 

ttctcgctaa 

cccagcaatc 

agccttgtta 

tcttaactgc 

cagatggtca 

tcacacacca 

taaggtgtca 

ctcatcagct 

atggagctgg 

ttgatggatt 

tagtgagtgg 

agtcaataaa 

tcgtggttgg 

agtctctgga 

gggacagaag 

tcggaggaaa 

aacagcagta 

accctggcta 

aaaggaaaca 

aagccagaca 

gcaggaggat 

ggcaacaaag 

agcaagagag 

agaaaagatg 

agttctaaat 

caaactaaag 

ttaaaaaagc 

agtgggcccg 

gaactgagaa 

ataccagcaa 



ttttatttcg 

aatgtatatt 

actaactcca 

ctaactttgg 

agccacgaca 

aaaataccaa 

aaaaaccgac 

acaaacagca 

aaggcaaagg 

ccatgatact 

actgggaaga 

atgtctaact 

gcaaaaaaag 

ctggtggatc 

tatagtcaga 

acagccacac 

agaatcccag 

aaggtaagac 

gcgtagagaa 

ttggttactt 

agggcagcag 

tgattacaaa 

aaacagggct 

tcagccatct 

actgcatgca 

tgctataaaa 

tctgtgtagc 

agatctgcct 

acatatcttc 

aattaaaaaa 

taatatatac 

acaattttca 

ccctgaaaat 

aaacttgcat 

acctggaacc 

ctcctgtctc 

catggctgct 

taagccatct 

gagtttacac 

aggcttagtg 

ggaaagctct 

cttagtaact 

gatgggctcg 

ctaaagtagg 

ccccacaccc 

acaaaaacaa 

gagctgtccc 

actgatcaac 

cagactctga 

tctgaaacat 

attgaattgg 

tcatcaaccc 

aaaagttatt 

tgggggtgca 

ccatgagttc 

aaaccctgtc 

agagagcaga 

ggcccgacca 

at'cattttaa 

caggggtggc 

aatagatgca 

ggaagcctgt 

catctcaaga 

taaattgtca 



cacactttga 

tgcattttga 

gtggacttct 

ttctcggcct 

gtcccatggt 

aaaaaccaaa 

actccctatt 

cataacaggc 

ggctcttaaa 

gctcttcatc 

gaccagtgtg 

gtgtctgcca 

cccactctgc 

ttttttcctt 

gcacggagac 

cagaactgct 

cattcaggat 

tttatttcac 

ctagctatct 

ggcgaatagc 

tgggtgcaca 

gtcacgcagc 

gcagagatga 

gcttctccaa 

catggtacgc 

cccatatatg 

cctggctgtt 

gcctctgcct 

cttcttcttc 

ctacatagtt 

atatataaca 

caataatgaa 

cgcaaatgac 

aaaggctctc 

tggggctcct 

acctttcttg 

gggactagaa 

ttctaccctg 

acacacacac 

accactgtga 

gcacaaaatg 

atggttgcct 

agaggccctg 

gggaatcgat 

acccatgagt 

aaacaaacaa 

gcacttatgg 

tctagagtct 

aggtgatcag 

aactcgacgc 

gcacaacatt 

aggcgattct 

taacaaaact 

cgtttttaac 

gaggccagcc 

tcaaaaatca 

gagagtgggg 

atgacatccc 

taatggctgt 

cgggatttac 

cgctaaccat 

gcctataaac 

caaagcacag 

gctttatgac 



atcatccatc 

tgcatggtgg 

gacgtaagag 

cccctgaagc 

ggtttgagta 

accaaaacca 

ttttgaagaa 

taacaacaac 

catttacaag 

catcaattct 

ctcactcatt 

atcacaggca 

ccgcacactg 

cttgcagtgc 

aaaaggatct 

ggggggcggg 

gcagaggttg 

aaaaataaat 

tttatgtaaa 

cctggtagga 

atgcagtggg 

tctactcaaa 

tggctcagtg 

ttccagggga 

ttacctacat 

accatcttaa 

ccagaacttg 

ccacagtgct 

ttcttttttt 

ttataggcaa 

ttctgtaata 

aagctttgaa 

ggttttcatt 

ttctctatca 

gttccttggc 

gaaccaatgt 

ctctggtctt 

attagaattt 

acacacacac 

aaagggaagt 

gtgtcctctg 

acagcaaacc 

ttattaaagg 

ttcctttatc 

atgtgtggac 

ataaacgttc 

caggaagaca 

gcttgttatg 

gacagagatc 

atactggtcc 

cagaaaacag 

ggcactattg 

tcccagagca 

cctagcactc 

tggtctgtac 

ctgactgggg 

gtttggatgt 

agaaatggca 

gtgtctgctg 

ggccagccag 

taatcagcgt 

ccaacctgcg 

gcacaatctc 

agataggctg 



121380 

121440 

121500 

121560 

121620 

121680 

121740 

121800 

121860 

121920 

121980 

122040 

122100 

122160 

122220 

122280 

122340 

122400 

122460 

122520 

122580 

122640 

122700 

122760 

122820 

122880 

122940 

123000 

123060 

123120 

123180 

123240 

123300 

123360 

123420 

123480 

123540 

123600 

123660 

123720 

123780 

123840 

123900 

123960 

124020 

124080 

124140 

124200 

124260 

124320 

124380 

124440 

124500 

124560 

124620 

124680 

124740 

124800 

124860 

124920 

124980 

125040 

125100 

125160 
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acaggcatac 

cgacacagct 

cacagcatgg 

tggaaaaggg 

agggcccagc 

gtacaactat 

gaagcacaaa 

agttgagaca 

ctctttacaa 

gccagaggcg 

ggtccaggga 

ctacaggagc 

tcagggggct 

gagttcagtt 

taaacatttc 

aaaaaatctc 

agcaaagcaa 

aaaccccagg 

accgaaacag 

accattcact 

cgatgaacag 

gatagctgcc 

gatccaaggt 

ggaggagggg 

tgctggcctc 

ctcacccagt 

acaaagaaag 

gatgtaggct 

tgctatcacc 

tcctcagcta 

cacatctgta 

acttaaaaat 

nimnnnnnnn 

nnnnnnnnnn 

tgtcatcttg 

attttattaa 

ctgtgccagg 

gtaacatggc 

gcttacttat 

agcgatggga 

ctaacctcag 

agcagagccc 

ggcgggaaga 

ttgtgtgtgt 

tgcatgtgtg 

catacaatgg 

gaacacattt 

caggaaatgg 

ctttcctggc 

caggcaccag 

aaataaaaac 

gtggatattt 

caaaaactgg 

ggtccaaagt 

tcttctggag 

aaaaaaaata 

agggaaagcc 

aggatcacag 

agaccccggt 

gccacaaaac 

tggtaatttt 

aactcactct 

tggattttga 

ccaaagaaaa 



cacaaagatc 

gagcctagtg 

ccagagtgac 

aagaaggcaa 

atccctctcc 

aaatactcct 

cataacactg 

tcaaacacac 

agtctccaca 

gcggcacaat 

gcactagtga 

ctagacaccc 

gatgagatgg 

cccagaaccc 

tatcctccaa 

tcgtggacag 

atccaaatat 

tcgctttagc 

aaaaggcatt 

cttaataaaa 

gaacacaaac 

agactgggct 

gctgcctccc 

tttgacacag 

aaaactcttc 

tttaatgcca 

atcacacaga 

ggtctcaaac 

catggtggca 

ccatgggtgt 

accccaccac 

aaataaaaac 

nnnnnnnnnn 

nnnnnnnnnn 

aagggttgaa 

aaaacaaaac 

gaacccactg 

tgcttacgca 

aatatcaaac 

aaaataacct 

ctgcagactt 

tgtgcctgtg 

gatggaaagt 

gtgtgtgtgt 

tgtgtgtgta 

ctatgcttat 

ctaaaatttg 

gtaggagagg 

accaactagg 

gcaggaaagc 

ggagcagaag 

tgtgagttcc 

gggctggtga 

tcaaatccca 

tgtctgaaga 

cctttaaaaa 

acactcgatg 

cgggacggag 

ctcaaaacaa 

accttcttct 

ttgtttgctt 

gtagaccagg 

tttgggggtg 

aacacccccc 



gggaagaaga aacgggattc 
acaggccgtg ataagctcaa 
agggtcacag caatacagca 
agagcaccac gagagaaaag 
cacctgacag agaaaaccag 
tcctagtgta gacagagttt 
taagttacca gaagtttaga 
gtttgatcct aggggaatta 
tttataaacc aaatacactt 
ggtggctgag gcagcaggat 
attcaagtaa acattaaaca 
tgtaaggtca gtactactaa 
ctcagtggct aagggcatgc 
gtatcaggca gttcgcccac 
gcacagacac acataaacat 
acagcaaaaa atactgaacg 
gcgaaaagga taatacaatg 
atccaaaaaa ttcaatcatc 
tgattattaa taaatacagg 
atgttaccaa acaggagaaa 
tggctgggca cagtggcaca 
ggcatgagaa ccaagggtaa 
ccccccccct gtgtcactta 
ggtctcactc tgtggcacag 
tgcctcagtt ccagagtgag 
tttaacagta caaggttaaa 
caaacatcac acaggtgctg 
ttgcacaaga aaagaatact 
tttcggaagc agaggcagga 
gaggccagcc aaggcagcac 
tcaggaggct gaagcaggaa 
ataaataagg ttggagagag 
nnnnnnnnnn nnnnnnnnnn 



nnggaaaaca 
atcactgagg 
tccttctgga 
ccagcagaca 
gacaccacac 
ataagtacca 
gcacatggtc 
gtgggtactg 
ccagggaacg 
ggagctaggg 
gtgtgtgtat 
tgtatgtatg 
gtgtctttta 
gaatacttgt 
caggagagat 
tcactcacaa 
ggccccacat 
ctgggcctgg 
gggtctacat 
gatggctcag 
gcaaccacat 
cagcaacagt 
aacaaaacaa 
gcgcaagcgt 
gccagcctca 
aacaaaacaa 
tcccaatcct 
ttcaaggcag 
ctggcctcta 
tgtgggtgat 
ccccaaagta 



cccattagac 
aatggctaat 
ggagtactgt 
cccaacttca 
acacttcctg 
tgtgagcgct 
agtaatgggc 
agaacggacg 
gaggacgctg 
cccgatgacg 
gtatgtgtat 
tgtgtgtatc 
ccacaagtaa 
tggggaagct 
ggcttagtgg 
ccaggatctg 
ccctgcaggt 
aagtccttta 
agtaaaaact 
tgggtaagag 
ggtggctcac 
gtacttacat 
aaactcccac 
gtgatcacag 
gctacacagg 
aaaaagcgtt 
tgaatgctat 
ggtttctctg 
actcacagag 
accacaggga 
tgaacatcaa 



ttgcacaaca 
aacatgcaca 
acagagacag 
ttaacccgcg 
gaggacgcag 
accaaagggt 
cacgttgaga 
aattattggt 
gtatcagaat 
caagtctgga 
ttaaataaga 
gataccaaaa 
actgctcttg 
ctgtaatgta 
aattaaaaat 
aaatctaatc 
caaatccaga 
ataattcacc 
gaaagcattt 
gaagtaaaat 
tcctgaaacc 
gaggcaccca 
aatccgtaat 
gctagcctga 
gggaaacaat 
cgctgcctga 
tttgagacta 
'tttgccatca 
agatcagtag 
gtgagccagc 
gttcaagcca 
ggnnnnnnnn 
nnnnnnnnnn 
tgctgaagag 
gacaaaactg 
gctcccttca 
aggataatca 
tatgctgggc 
acagggcctg 
aatccactcc 
tgactaagtc 
ctcaatgggc 
tgaatatggc 
gtggtgtgta 
caaaatggtt 
aaaattttaa 
atctgagccc 
ttaacacaca 
acagcctctc 
aaaacatttg 
cctccagcac 
tgtcgccaag 
cacccgactg 
aaccatccgc 
atattaataa 
taaaataatt 
tattctgtag 
ccggtctggg 
cacattatca 
cgaatttggt 
tgtagccctg 
atgtaattct 
cacttgggcc 
ctgtatataa 



ttttacaaat 

ctgcaaacac 

taagaaacat 

attccatatg 

ctgaaacact 

atcgtaatct 

atttaataga 

aacagaaaag 

tgttgtctga 

gaagcccctg 

aagacggatt 

ccaaactgac 

cagaggacct 

ccagagaccc 

aaatcttaaa 

aggtagactc 

cttatcccag 

gtattagcta 

gacaacattg 

ctcctcaacc 

ccagcaccca 

aacacttaat 

taaatttctg 

atggagtcaa 

catgagccac 

aaagaacaag 

ggtctctagt 

agatcaaggt 

tacaaggtca 

tgtggaagca 

cacaagagcc 

nnnnnnnnnn 

nnnnnnnnnn 

actgagaaat 

cttaaggaga 

gcattaacta 

agcttgctac 

atcacctcta 

gccaatgaac 

ccagacactt 

aattcacaga 

agagcttcag 

taatgccata 

tgtatgtgtg 

ataaaaatcc 

gtaatcttag 

cagcatgaat 

catggtggcc 

ctggcctcct 

tactgaaact 

cgggaagcag 

taaaacaaaa 

ctcttccaaa 

aacaagatcc 

ataaatcttt 

ggaaaagtcc 

gtaaggcaag 

ctacagtgtg 

tatactcaag 

tacttttttt 

gctatcctgg 

ttttcaaagc 

ccaaaccaaa 

aactaacagt 



125220 

125280 

125340 

125400 

125460 

125520 

125580 

125640 

125700 

125760 

125820 

125880 

125940 

126000 

126060 

126120 

126180 

126240 

126300 

126360 

126420 

126480 

126540 

126600 

126660 

126720 

126780 

126840 

126900 

126960 

127020 

127080 

127140 

127200 

127260 

127320 

127380 

127440 

127500 

127560 

127620 

127680 

127740 

127800 

127860 

127920 

127980 

128040 

128100 

128160 

128220 

128280 

128340 

128400 

128460 

128520 

128580 

128640 

128700 

128760 

128820 

128880 

128940 

129000 
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tcatataagc 
tctttcagta 
atgtaaacga 
ggtcagggaa 
cgtcctcacc 
tgcccaagcc 
actttaaaag 
aagaccttgt 
acaatggcaa 
agggaaggac 
actagtctat 
ctctgcatta 
tgttcatttg 
aaccgcatgg 
ggaactcggg 
cccaattcac 
caggctagct 
gtttccaccc 
ttcacaccaa 
atatgcactt 
tttactaggc 
accagctctg 
tctgtgcaat 
ggtaagtcct 
tggcataagg 
tccccaaggt 
acccactggt 
agcgaataaa 
taaagtaact 
taacggttcc 
acagtatcta 
taggtgattc 
ggccgtgcta 
gcttaaaatg 
ccaagtaaac 
ggcttcgcac 
cctcccttat 
aatctgcccg 
ttaccccgta 
gctgcccagc 
ggcctatctg 
acaccgtagg 
tgggaaattc 
tcttattgca 
ccaccgctgc 
aagcaacaca 
gtgtacatcc 
aaatcacacg 
ctttttccat 
ctctgtctct 
tctccggaac 
cctcccaagc 
tatagttatt 
gtgtgcagtg 
cctactgtgg 
tgagttatct 
tcatataaac 
cccctcacca 
tactcactga 
tctttaggga 
agccattcag 
agacaacctg 
ctcacaggct 
atgactaaga 



taagtagcct 
gatgccattt 
aaatatgtta 
gcacggctgc 
agcaccaaga 
tgaaaatctg 
tcaattccag 
gggtgattta 
aagctcctcc 
actatacaca 
tttttttttt 
aataaaccta 
ttttgactgt 
agcaggtctt 
tggtcagatt 
ctgtttactc 
ttgaactcac 
aagtgctggg 
agtggcttcc 
gcgtttctct 
tgaagcacac 
atgcaaaaac 
ggcattccct 
cactcagtgc 
ccagcattat 
acatagtgtt 
tctcttgaaa 
taacaaaagg 
tccaagggag 
cccttatgag 
atgaattaca 
tgaccaagcg 
gccctgctac 
tggtcgaact 
ttccgcagat 
ggcaagtttt 
caatatgctt 
catacttact 
acaagtgctg 
agggcaaagt 
atttcctcag 
cttctcacaa 
ccggtcacac 
gtcctgggtt 
cacagtgctg 
ggtctggaac 
tagctcccac 
tgctacacgc 
tttggagacg 
atagagtgcc 
tcactctgta 
gctaggatta 
atttgttaca 
tgcacatgtg 
cgtttgggga 
cgacactgac 
cacagcgtgc 
tgtgagtgag 
gttcccttgc 
tcagtaagtt 
tcagtctcta 
aagacggaag 
tgacaggcaa 
tcttctaaat 



gcaggtacat 
gagaaaaaaa 
aatcttaaaa 
aggcccccag 
acagccaact 
atatgaaaac 
ctatctgtgt 
aacagcgtgg 
tccgtggcct 
ggtaccaatc 
accttagcta 
gtgtataaga 
gtggtgtgca 
ctccttccac 
tgcttggcaa 
taatgtatat 
tgtggacact 
acaacagaca 
tatgaccttc 
ttttttactt 
ttcaggagac 
aaaaaacagc 
gacgcattac 
gcgcatccac 
cacagcgcgg 
ctgtccaagg 
atctgcaata 
aagacacccc 
aaatgacatt 
gaagtattca 
ggagccactc 
taggctatgc 
atgcaggtaa 
caatctgcta 
gtcggagcat 
taatgagcct 
gacatctttt 
gggcagccac 
gggttacaga 
cttccttatt 
cacaccccca 
tccacactac 
ttctcacagc 
gggcaggcct 
aaatagtccg 
aggcaactca 
gctcacgagc 
agaacccaaa 
gcggctcact 
tggctaccac 
gaccaggctg 
aaggcatgtg 
tatgtgtatc 
tgcaggtcag 
actcaggttc 
atgtaatgat 
atgtgcaggc 
ggagaggaac 
tggccaagca 
tttccccaag 
ctgccactcc 
gtgagccaat 
ttcttggact 
ggaaaaatgt 



ggttatggaa 
aaaaaacaaa 
aaccttaaag 
agcagcacat 
ataataagct 
agagggcaag 
ggaaggactt 
agggcagaaa 
ctacacgcat 
cccacgaact 
gtgtctttct 
aggcaaatga 
tgccaggcgc 
caccccaggg 
gggcttcact 
atttttgaga 
agctggcctg 
gcaaccatca 
tcctgacaac 
caagctttga 
accctgtagc 
gatggggatg 
ctcacaccag 
cacacccaaa 
acaaacacaa 
atggcctctg 
tttgcctaaa 
aaggacgagg 
gccgccacac 
cttctggtca 
aataccttta 
aaatgctctg 
tgctacgcca 
gaggtgtgag 
cagacagagc 
ccatggtaat 
agttttttga 
agctggccct 
catatgccgc 
tacaaagcag 
agggtctcac 
tcaccgataa 
ggtacaaccc 
ggtacataca 
cctgcaggcg 
aagctgctct 
gatatgcaga 
gaagtaacaa 
agctagcctt 
acctgggcac 
gcctcaaact 
ccatcagcgc 
agtgtgtatg 
aggacaactc 
caagatagca 
tcagtgtgca 
taggaaacaa 
tcaggctgtc 
ttatttataa 
acaagccaga 
tcaacctgtc 
aaaatcttac 
gaatgcgttt 
ctggttaagt 



acagctttat 
acattctttg 
cagacgccac 
tccctgaggg 
cataaaaaat 
acagataaaa 
ctcgggacgg 
aagaaaggaa 
ccaagcgata 
agaaaacaca 
tatgtttggc 
gtaaaggtaa 
attcgtggag 
gtcgcagggg 
cactgaggcc 
cagggtcttg 
gaggtctgag 
aggcaaaatg 
cccaaacagc 
ctccactttc 
tctggagtga 
gggcaaaggt 
atacatacag 
gcaagcattc 
agccaacctt 
cagactcctg 
aaacacaact 
gaagattatc 
acagcccccg 
ccccgtgttt 
tcataaagtg 
agcacactta 
gtcttacgcc 
tttagacctc 
tgtcccctcc 
agtgggttcc 
gacagcaaat 
gaactcctcc 
catgttcagt 
cagccaagcc 
cgatggcagt 
gatcatgcgg 
attctgctgg 
gttctctttg 
aaaggaagac 
tccttgcaag 
atcactaact 
accggcactc 
gaactcagag 
tttgttttcg 
cagaaatcca 
ctgacccaat 
acgtccatat 
tcaggagtca 
ggaaaagtgc 
cacagtggtt 
cctgtgggag 
aggctcggtg 
gatggtgatg 
aaaacctttt 
tgtggggcag 
tgacagtaac 
aagagaatgc 
aacccagcaa 



catacagact 
gattttcaac 
tgctctttgg 
aagccttgtg 
cctgaaaggc 
ggcaaactat 
actatctgct 
agtgcaggaa 
tggagggagg 
gcttttacta 
ataatttctg 
agtcagttaa 
gtcagaggac 
tcccagggat 
ccccaagggt 
ctatgtagtc 
tgaacctcct 
aagtcttccc 
aagtgcccgg 
ctaaaaggtg 
ccggaaacac 
ggcttgtgct 
aaaagacaaa 
aacagctcag 
ctctttgagt 
gaaacacgtc 
ctattatttc 
ttttcctact 
actctttggt 
cacttttagt 
ggctttgtgt 
ggggaggctg 
acaggaagaa 
gggagacggg 
tgatgcaaaa 
tctctctcct 
aacgtagccc 
tgcgtctcct 
gtaagtgacc 
ctgcagccca 
cagtccatga 
tatttgaaat 
tcaatcactt 
cggagaaaca 
cgccatcagc 
tggtgagcgc 
ctggtttcag 
acgcacctca 
tttggtctgc 
agacagggtt 
cctgcctctg 
tctttttatt 
gtatatgact 
gttctctcct 
ctttaacagc 
ttgcatgtag 
ttggtgatct 
gcagtgcctt 
tctaccctta 
aggcccaatg 
gacaagccac 
agccagccag 
agaattaccc 
ggagctaagt 



129060 
129120 
129180 
129240 
129300 
129360 
129420 
129480 
129540 
129600 
129660 
129720 
129780 
129840 
129900 
129960 
130020 
130080 
130140 
130200 
130260 
130320 
130380 
130440 
130500 
130560 
130620 
130680 
130740 
130800 
130860 
130920 
130980 
131040 
131100 
131160 
131220 
131280 
131340 
131400 
131460 
131520 
131580 
131640 
131700 
131760 
131820 
131880 
131940 
132000 
132060 
132120 
132180 
132240 
132300 
132360 
132420 
132480 
132540 
132600 
132660 
132720 
132780 
132840 
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cacgcaagcg 
aatcatgtca 
cctgcGcccg 
tatacagacc 
aggcatttgc 
catttaaaga 
agtggaagag 
cgaatacata 
agctggttgc 
ccaggggatc 
tgcaggatgc 
gtggatttga 
tggaacatgc 
gaaatagctt 
tcaaagccca 
tccgttgtct 
gatgtttaat 
ggatcgagca 
aataatggag 
tccaagagaa 
cagtggttaa 
gtggtggctc 
ggctacagtg 
aggaagagga 
gccagtctgt 
gctacattat 
caggcctgag 
aactcaagcc 
cctggcccag 
cggcccctcc 
caaacctagg 
tgtgctgtga 
cacctagtcc 
ccctcctcct 
aactgggctc 
cccctccccc 
aagggacatt 
gcccaatcag 
ctcagtgaga 
taaaattgac 
gaaggtagca 
ctacacgatt 
ccaacagagt 
aatccgtctc 
gacggctcat 
ccactcttag 
gaacttgtag 
tgtgatcaca 
ggggttgtgc 
gtctctggtc 
agacccacac 
taaatattgt 
ctgtttttct 
ggtactacag 
ctcaatcctt 
cttaaaaagc 
tgggaatgac 
cagtgacaac 
attctacccc 
cagcgcgcag 
taattgcttt 
ggctttcagg 
ctcaatagca 
ctctggttta 



gtggatacct 
agtacctaca 
gacccccacc 
agctgttatc 
cactatgcct 
gacgaagcta 
cttgcttagc 
agaaggctta 
ttaatttccc 
tgatgtcctc 
ctcgggccac 
gtttggtccc 
atgctctacc 
ttaggaggat 
atttaaaagt 
gtctttagtt 
tctcaagatg 
cacagtgaaa 
cagtggcgag 
cgcaggttag 
gagcactgac 
acaaccatcc 
tacttacata 
ccactgagca 
ggcatggcag 
gtccctctac 
gtcccaggta 
cctgccgttc 
gttctcagat 
actcctgtgg 
ggagaaagaa 
atgcaccagc 
actcaagaaa 
ccacccgcaa 
acattgacag 
caccctcggc 
tttgtttcat 
ttatagaact 
atttgtttat 
tatacttctt 
aggacaggct 
ggtggtaact 
agcggtggcc 
accactacta 
caggagtgaa 
agccaagatc 
atcaatgaca 
gttactgaca 
tcagagtctg 
agacttctgt 
cactgaggat 
tattggttta 
tgagtctctt 
ctattcaatt 
ctiacacaaca 
tagatatggt 
aatgtacagg 
acaacagaag 
cacatcagct 
ctgttaaagc 
ctctggctct 
ccaccatctc 
aaggcttggc 
ccagagcccg 



gctttgcctc 
aaatccagat 
cggggtctca 
aaattcagag 
ggctctcact 
gaaaccagtt 
atgcacaagc 
gaggaggggg 
agtccccaca 
ttctgacctc 
tatgctggct 
tgggacccac 
cgcaaattaa 
gcctaaggaa 
ttgagtgctg 
taagagagca 
tcctgtgata 
tgagacagtg 
cgatattttg 
accaaggtag 
tgctcttctg 
gtaataatat 
taataaataa 
cacattgaaa 
caggaattac 
actggccgtg 
aggacacagc 
tctgggtgtc 
ttagcctcat 
tcagagatgg 
acaaacgtac 
cccgagctgc 
tgcaatgccg 
ggtcagacaa 
cagccatgaa 
ctcaagcctt 
gagtctgcca 
caatcagaag 
ggaaatatca 
gaccatctat 
agtggcagag 
aaagctgccc 
caaggcaagc 
gccgctcaag 
cagagccagg 
taaagatcgc 
gtattttgtt 
gtgaaatatg 
ggagaaaaac 
cctacaatgc 
gccttaagtg 
catcagactc 
aaattctacg 
tccttaagct 
ccctcaggat 
gtggaggttc 
caacaaagag 
atgtgaccag 
gcccacagtg 
cttgccgtac 
ctatgttcat 
agaagacata 
aaaacagctt 
agcacagata 



tgaaccctgc 
ctagcctgaa 
ctgtgtagtc 
acccacttgc 
ggttgtacca 
tagacagctt 
tgttggtttt 
tggtggtgga 
tggtggctca 
cttggggaca 
caggggtgaa 
aaggaagagt 
aaacttaaaa 
cttctctgcg 
tcacgtgttc 
actgggtgag 
gcattataag 
aggaagaaaa 
ggaatacaat 
gaagaggatc 
aaggtcatga 
ctgatgccct 
ataaatcttt 
tggaagatgc 
agtctgttca 
actgaaaagc 
cctgacctgg 
tggagcgagg 
gcaaagtttt 
aaacaccatc 
tgctgacagg 
gccccctcgg 
ttgctccttc 
actgccaaag 
caaaggcagc 
caccccaatc 
aaaatgtttt 
gcattatctg 
aggttaaatt 
tggctaatct 
gacagtgagc 
ccacacagga 
agtcacatcc 
tttaacttca 
agacaggcag 
taatccttgc 
cgggggacag 
aacaccttaa 
caaccaacaa 
aacgctagct 
acagacaccc 
aaaacaaaaa 
acacagctgg 
tgggctaggt 
gaagtgctct 
taacaagctc 
tccattgttc 
ctccagagac 
caaactgaag 
ctggcgggat 
caggggatcg 
atgcctcgaa 
ggcagcagtc 
acccctgagc 



acaggttttg 
ttcaaagata 
ctggctgtcg 
ctcccaagtg 
gaggagcaaa 
tggggctgtg 
aaccctcagc 
gagatgggtt 
caacatccat 
tgcgactcat 
ggtgcttgct 
ttgacttcta 
tttaaagagg 
ttttcaggtg 
tgtatgccca 
aagtaactga 
ttgctgtgga 
cctatattgt 
agaaataaat 
actgggctgg 
gttcaaatcc 
cttctggagt 
aattaaaaaa 
aactgaaatg 
aaacccagca 
agcaacgtgg 
cactaggaaa 
ggtgcgaggt 
ccagttggtg 
taaggcttgt 
aacttcacat 
ctctcaccta 
catctcgtgt 
taccacagac 
aacagcaagc 
accaaaatga 
catgggagaa 
ctgtgttaga 
catttatatg 
gaaggaaaag 
ctgagatgaa 
ctgactccct 
gtcagaggac 
cacaggcatc 
cctctcttat 
ctgggtaggg 
acatgcctgt 
tgtagacaca 
accagaccaa 
atcagcacag 
cagatggatg 
tgaggagcgt 
aaaccacaca 
aaagtatttt 
tccaaggcag 
tcctcatctc 
tcactgtact 
tcagcagagt 
atgctctcag 
caaggtcagc 
tgggtttgtt 
gaagtatagg 
cacatgctga 
tgagttgcac 



gtttatgttc 
ggctacctac 
tagcgctctc 
ttgggattaa 
acaatgtggc 
ggtgtagctc 
gtgacagaac 
agaggttaag 
aactgcagtt 
ttggctcaca 
gccaagctgg 
tacactgagg 
aagctgtaga 
agattcagac 
gttctggctg 
gagtctagcc 
tgacagtgat 
atacgaggac 
gattcaaata 
agagatggct 
cagcaaccac 
gtctgaagac 
aaaaaaaaaa 
caataccaga 
cggacctgag 
taccctggag 
caggtgtaaa 
accttgtctc 
ttgccccctc 
ccttctgagt 
ccttctcaaa 
ggtctcagca 
ccttcaatgc 
cctttcccta 
cacactggca 
accagatgaa 
aatcttttaa 
gatttgcatt 
aaattattat 
aggccaggga 
aacacctcaa 
aaggagactc 
aaggtccata 
ggcaaccaaa 
agtaaaaggt 
ggggtat:act 
ttgtgatttc 
caattaacct 
tgccatggct 
ttacacagcc 
ctctaagtag 
gaaatatgag 
tgcccaccct 
ctttgaccac 
agtttaatgc 
ctgagtgctt 
ttaccgagtc 
aagaaagtac 
cactgttctt 
acttagattc 
ggtttaagag 
aaccactcca 
cagtgtccag 
aaaacctacc 



132900 
132960 
133020 
133080 
133140 
133200 
133260 
133320 
133380 
133440 
133500 
133560 
133620 
133680 
133740 
133800 
133860 
133920 
133980 
134040 
134100 
134160 
134220 
134280 
134340 
134400 
134460 
134520 
134580 
134640 
134700 
134760 
134820 
134880 
134940 
135000 
135060 
135120 
135180 
135240 
135300 
135360 
135420 
135480 
135540 
135600 
135660 
135720 
135780 
135840 
135900 
135960 
136020 
136080 
136140 
136200 
136260 
136320 
136380 
136440 
136500 
136560 
136620 
136680 
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agccacgaag 
ggaccgagag 
gctgccgaga 
aagaagaatg 
acagacctcc 
cacacagcga 
ccctcaggac 
accttccctg 
ttcactgtga 
aagaacactt 
gcaagcacgc 
ctcaaccctg 
tgtgaaatca 
ctcattattg 
cttcagggcg 
ctcagtccat 
agttcagcct 
gaaagaactt 
agggagagca 
ttttgacagt 
gctgcagaca 
aaacaaaagc 
gcctcgagat 
atcccggaac 
cacacgtgtg 
aggaatagct 
gaagcccaca 
cctaagtgac 
gctttcaagc 
aacttaacta 
tgaatctgcc 
ggggagagag 
tgttttacca 
gaggtactgg 
agaattgagc 
ctggcccctg 
ccgcagacta 
ctcggcaggt 
agtctctgca 
tttaggagca 
cagaaggaaa 
nnnnnnnnnn 
nnnnnnnnnn 
atctctatat 
aatcccattt 
tctagaagag 
ggcagcttat 
ataagcgcag 
atggttgtga 
tgccttccac 
cttagtactt 
ttctatcatt 
gcgtgaagta 
cccctgtgtc 
tgtgtgcaca 
ccttgaaata 
gatgagatgt 
cccctcttct 
gcagtgctgt 
aggcaaagtg 
tttgtttgtt 
tagaattcta 
gcaagattct 
caggtattct 



cttataggcc 
gctccgtcca 
gccgtcaaac 
tacaggatgg 
cactcgggaa 
aggtcttcaa 
tcggctcagt 
acatgtccat 
ttttaatcga 
tgtaagagct 
tagtcacgca 
gacgcaccta 
aactgaactg 
aaggtcatct 
ccctttgaga 
caaagcccgc 
gtcagctctg 
gtccacttgc 
cattagaact 
tagacaccat 
ttgcagaatc 
agctcagcca 
gatctggaag 
cctcgggatg 
catgcataca 
taatattccg 
ggaaggcgtg 
ggcttacttc 
tgactgcact 
aatggctaaa 
atcacagtta 
ccccaaagct 
gcatgtatgt 
atcactgaca 
ccaagtcctc 
cattttgctt 
gttaaagctg 
atcaccccag 
gacggagact 
ctttatctca 
gccacaatga 
nnnnnnnnnn 
nnnnnnnnnn 
atgaatgagt 
acaggtggtg 
caatcagtgc 
ttttaattgt 
gcactcatag 
gccatctgat 
ctagaattac 
tcaacatagt 
ctctacaccc 
tgagttagag 
tctgagacag 
gcgctctcaa 
acagatttct 
ttttgaagga 
acagagaaga 
actctaaggt 
cagactcata 
tgtttgtttc 
ttgttgatta 
tcccatattc 
cactgagctg 



tctgggatgt 
ccgaagtcag 
ttgtcagcct 
ttttatggag 
ggagcattgc 
atacttactg 
gcaaggactc 
caaatagata 
atcttcatag 
cccatcaagg 
tcaccccaaa 
ctagtgcgtc 
tgggtaagtg 
catagtcatt 
agtaatacac 
cccacaatgc 
actcaacacc 
tcattgaaag 
gcgctgctag 
cctcggagca 
ccacatatac 
cccgagccta 
acaaaggtgt 
gagaagacaa 
cacgaaaata 
ataatgaaac 
ctgaactcca 
acagtcagga 
gttacttttc 
accttaccca 
agatatctgc 
ttgcccttcc 
gtgagcatca 
cgggagtcac 
tcaaagaaca 
taagtaaagc 
agtagctgct 
ggcttcaact 
atttacagtc 
tgtctgaccc 
ccgccatgtt 
nnnnnnnnnn 
nnnnnnnnnn 
atcctgtagt 
gtgagccatc 
tcttaacccc 
tttaaatcac 
aggtcagaga 
atgagcacca 
agtatatgca 
gtaaatcgga 
aggcttcatt 
tcagacaatc 
gggaggattc 
ccatttgttg 
ccattttgtt 
actttctagt 
aggaaacagg 
aacgtgcgtc 
accagtccat 
ttcttcttta 
tattcacaaa 
cttcagaccc 
tacagcattt 



caggattcac 
agactcgggc 
gcaaatcaaa 
aagagtcagt 
actacaagaa 
gatagcacaa 
acatcttctc 
tttctcttag 
ctcttgcaga 
ctcctcttta 
cactggagac 
tttagccttg 
atggccatct 
tttaacagcg 
ctgcaaaaga 
agccttccaa 
gtcttcttac 
cagtagctct 
gccctcgctc 
acagtccaag 
agttacattt 
cagttgtaag 
ttcctgtgcc 
tcgccccacc 
attcttacat 
ctaatattct 
atgattcagc 
gctggtgcca 
tgtcaggcaa 
gacaagctgc 
tgtgctacac 
cggtatcatg 
ctgagtgtct 
agatggttat 
gcgagtgctc 
ctagtgagta 
gctccttctg 
gtgcccctag 
caccaatctg 
tgtgccccag 
cctcggagct 
nnnnnnnnnn 
nntttttatt 
tatcttcaga 
atgtaattgc 
tgagccccct 
gtatatgttt 
tgtaagatgc 
agtggacttg 
attagctgct 
gagcactcct 
gggacgtgtg 
agtgctctct 
tcctgttgga 
ccatttcaaa 
tgttcccttc 
agttactcgg 
aaattccaag 
caaccagcag 
tgttttgctt 
ttagtcggat 
caaaagactg 
atagcagcag 
gtctgaagaa 



aatgacagtg 
tcctttgatg 
cgcacgcaca 
caatgtctgt 
gctgcaataa 
ctccctgagc 
cccacagagt' 
caacttctct 
ttccaatgat 
aaacacggca 
aaaaatcacc 
ctctctaggt 
tcacagggaa 
gagaactgtt 
ggccaagtca 
actgtgctca 
cttgttcact 
gatttcaccc 
tcaacagcga 
tgtaaagaga 
tcaagctatc 
ttaacacata 
aacctggcca 
aagttattct 
cttatcaaaa 
gggtcccgag 
actcttcctc 
aaaatgttcc 
tatgttacag 
aggaaattag 
cgtgacaaga 
gcttatttta 
gatgcctgag 
gtgctactct 
ttaaatgctg 
taaatgatca 
ctggaggcaa 
caatgtaatt 
tattgctaca 
atagcatcaa 
gagccccacc 
nnnnnnnnnn 
aattatgtat 
cacaccagaa 
tgggaattaa 
cccttttttt 
ctgtatgtag 
ctttggagct 
ggtcctctgg 
cagccacccc 
tggtgcaaag 
atgccagcga 
gctctgagtg 
tcacatagtg 
tacagtgact 
tttcgtcaca 
tggaaaagga 
aaaaggagta 
ttgaacaagg 
tggttagttg 
tggatttggg 
tttatatgtc 
caagcttagt 
ctgatgaaca 



ctggatgaga 
gccatcacgg 
catcactgac 
gfgactctaag 
ccgatcatct 
taaagcccca 
tgtggtcacc 
gttgttcgac 
gtctgaaaca 
gggacgaaag 
actctttgcc 
caccgatccc 
ggacagaagt 
tgttagcgat 
gggcaagcta 
tggctctcac 
tcaataaggg 
tgcagaagca 
gagcaggcca 
ccctcacagg 
agctctggta 
aaacgaggga 
ccctagttta 
ccagcctttg 
catttttaga 
tgatgggctg 
acgcccgcaa 
aacaagtcat 
atggataaga 
tttctgatac 
ggaagaagag 
cgtgtgtggg 
gtgaccagga 
gttggtgctg 
agcagtctct 
gaagccctcc 
acccgccctg 
agagcgctgc 
gacgcgtccc 
aggtctccaa 
cgnnnnnnnn 
nnnnnnnnnn 
agatatagat 
gagggcatca 
actcaggacc 
tttttttttt 
ctatgttcac 
ggacttaaac 
accgccttgc 
ctcagtcctc 
attctgtgtg 
tgattcattg 
ggtctccctg 
catagagccc 
tcagagtctt 
aagacctgag 
caatgatgct 
cacagatgct 
cggaagatag 
ggttggttgg 
aaaaatgtag 
ctgaattttg 
ggccctttgc 
ttttagctta 



136740 
136800 
136860 
136920 
136980 
137040 
137100 
137160 
137220 
137280 
137340 
137400 
137460 
137520 
137580 
137640 
137700 
137760 
137820 
137880 
137940 
138000 
138060 
138120 
138180 
138240 
138300 
138360 
138420 
138480 
138540 
138600 
138660 
138720 
138780 
138840 
138900 
138960 
139020 
139080 
139140 
139200 
139260 
139320 
139380 
139440 
139500 
139560 
139620 
139680 
139740 
139800 
139860 
139920 
139980 
140040 
140100 
140160 
140220 
140280 
140340 
140400 
140460 
140520 
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tgctcctatg 
aatgcctctg 
caggtgaatc 
gatatgtagt 
aaaaagaaag 
agaaagaaga 
tgttaacata 
tttaaattat 
tgctggccac 
cagatgtgag 
ctcttcgcct 
gtcgttcttc 
gaactgcatg 
tagagtcaca 
cagtgttctg 
ctgagtcatc 
caccacgctg 
ctgagtcatc 
ctgtgttatt 
cagagatctg 
attaaaagaa 
gggaagaagg 
acgtgagctc 
gctccacgct 
ggttcagagt 
actttccgaa 
tgtaatggtg 
cccatggaaa 
ctcaggctca 
ctgcagggaa 
catacacata 
catcagtcta 
atggagaggg 
tgcaatttta 
gtaaaagtaa 
tcataaccag 
tgtgagaatg 
ccactctttc 
caagtaattt 
agatctaata 
aatattttta 
gggtcaagcc 
tgaaagaatg 
tttgttagta 
aagagttgca 
cagccccata 
aacagcctag 
tgactttggc 
atccttgggt 
caagtgggcc 
tgagttcagt 
tctgacctca 
cactcgggag 
atttacagtt 
tcagtggtag 
agagagagag 
aaggaaggaa 
gaaagaaaga 
gaaagacaaa 
gagggcatag 
cctagcaccc 
actactgtgt 
agtaatctgt 
ctagcctaga 



ttagtaaata 
gctgagtttg 
tctgagttca 
aagactctgt 
aaagaaagaa 
aaatgagtgg 
ctactattaa 
tattttatat 
cacacataca 
acacgcctct 
ctacaatcaa 
ctggccagtt 
tgcacttggt 
ggttatgtat 
gatttaactg 
tctccagccc 
gatttaactg 
tctccagccc 
ctaactcttg 
tatgggctta 
tagagtgagc 
tcaggagttg 
tatgagaccc 
accacttgcc 
attgtgcaca 
aaatattaaa 
taagaaagta 
gttgcttaat 
cgagtgctta 
tccaaaaccc 
attttaaaaa 
ggctacacaa 
agatgcactt 
tagagagctg 
taaatgttac 
tgaagtaccg 
tgtttttctt 
ccttcacgat 
tccatagcta 
ttggcataat 
tgtacagcca 
tctgagtgac 
tgggaatgct 
ttatacattt 
aaaatatatg 
atctaccttg 
tatggtggcg 
tccgaaagcc 
acgtcactca 
aatgagatgg 
cttcagattc 
gtacacatgt 
gcagaggcag 
atataaagaa 
agcgcttgcc 
agagagggag 
ggaaggaagg 
aagagagaaa 
gcaaagcaaa 
atcagtgtta 
taccgcaaaa 
actttcacta 
tcttggtttt 
acacacagag 



gctaggtttt 
gtggcacttg 
aggcagcctg 
ctcaataaag 
agaaagaaag 
atttgccata 
caacaacaaa 
gtatggtgtg 
taggcagtga 
cctaggccta 
gctctcccaa 
gagcgcgcac 
acccagggag 
gggttctagg 
ctgagtcatc 
caccaagctg 
ctgagtcatc 
caccacgctg 
cttccttgga 
aaacaagaat 
ctaggagttt 
aaggctagtc 
tgtctaggag 
taacacgcat 
ccctgtatct 
gctacatcat 
tttggcttca 
gaaaacaggt 
ctgcccttgc 
tcttttggcc 
taaaataaat 
gaccttgtct 
cctattccat 
ctattttctt 
tttggaaaat 
ttaacatttc 
ttacaaaatt 
ttattgtacc 
actctgtgtt 
atttggcatt 
gtgttttgtt 
gctgaattgt 
catttcttaa 
atttaggggg 
gtgtggtttc 
ttaaatatac 
ttaacctcgg 
ttcctggaat 
tgtgcacaca 
ctccatgtat 
tgcaagataa 
catggtacgt 
gaggatctct 
actctgtttt 
tagcaagcgc 
agggaaaggg 
aaggaaggaa 
gaaagaaagg 
actaaataaa 
taatgcttac 
tatttgctct 
gcaacataat 
atttttcttt 
aactctcttt 



tagctagcca 
cctttagttc 
gtctacataa 
taaacagcca 
aaagaaagaa 
gcctagtcaa 
tactttacac 
ggaagctgtt 
aggttctttt 
tgtaagcagc 
taaacgtgtg 
aagagtatgg 
cccagaagag 
aatcaaaccc 
tctccagccc 
gatttaactg 
tctccagccc 
gatttggaag 
accctgagaa 
atgtccaact 
ggcacactcc 
agtcttagct 
aggagaaaga 
gacgcccaag 
ttaaaagtca 
ccactctaca 
gttttatggc 
aaaaataaga 
agagaccatg 
tctgcaggca 
cttttaaaat 
caagaagaaa 
gaagctgcta 
tcttttaaag 
atagagaagt 
tgcttctatc 
gggataatgc 
cattttctca 
ctgttacaga 
atgatgctat 
tgggatgaat 
tctcaggaaa 
gcaacactgg 
ataggaaata 
ttttcctgtc 
tgtatataag 
gttacttgag 
tccagctcca 
tacacatttg 
aaaggcagtt 
caggagagga 
gtgtagtatg 
gagttttagg 
gaaaaacaaa 
aaggccctgg 
agagggagag 
gaaagaaaga 
gaaagaaaga 
atacatacaa 
ctagcatgcg 
gtcttgctaa 
gagaatgttc 
gccaaattga 
atgaactcaa 



ccttgctagc 
cagcactcag 
caagttccag 
gaatgactta 
agaaagaaag 
actagatttt 
taaccatgac 
gcagagtggc 
gccaagacaa 
accagttctg 
cagaaggatc 
gttttcacct 
ggcatcatat 
atgtcctctg 
caccatgatg 
ctgagtcatc 
caccacgctg 
cagaactgag 
aattccttct 
cttgatctct 
tatgttccaa 
ttgtgacaag 
aaggggtaaa 
gtttccgttc 
atagtcaacc 
aacttgtaaa 
ctttgcatcc 
caggagggac 
gcaggcagtg 
accacattaa 
gagctctaga 
gaaatgaagg 
ttttggtgat 
taatgcttgt 
ataaagagta 
cggccagcca 
tgcacttact 
cagtattaaa 
gaagaaatgt 
aataagcatc 
tagcacaaga 
ggactagtta 
ttattattac 
tgctcatata 
ccctagttta 
gcatatgtag 
aactgctctt 
agggatgcag 
gtttttaatc 
atgcaaaggt 
ccaacccctg 
cacatgcaca 
ccagcctggt 
acaggggttg 
gttcagtcct 
ggaggagagg 
aagaaagaaa 
aaggggaagg 
tatattaaat 
taaaactctt 
attatattgc 
actatcccac 
tgggtgaaca 
gtttcttatc 



ttacaactag 
gaggcagaag 
cctggccagg 
aatattgcta 
aaagaaagaa 
cttggttata 
aagctatatt 
agttggctac 
gttaaccaat 
ggctcagggt 
ctgttgcagc 
acatttatgt 
cccctggaac 
gaagaacagc 
gatttaactg 
tctccagccc 
gatttaactg 
gctttgaaca 
attggctttt 
gattttatat 
gcttcaggaa 
tttgaggcta 
gtgaggtgtg 
ccagccctgg 
cttggaatta 
agcccttctt 
taaatgttac 
tgcaaagatg 
ggtctcccta 
cacatgcaca 
acgagtttga 
tttgctgtgg 
tatggtactt 
tgttttgact 
aaaaaaaaag 
gagtttttcc 
gttttgtagc 
ttttcagctc 
acttaattta 
tttctgtata 
gtaaagtgtg 
acatccattc 
atactattat 
gttcaatttc 
gtctcacttc 
gtgaataaaa 
gcagagaaca 
tgcctctggc 
ttaggaactc 
ctggatgaca 
cgagttgtcc 
gaagtcccag 
ctacaaaacg 
gggatttagc 
caactctgga 
gaaaggaagg 
gggaaagaaa 
aagaaagaaa 
tttaagactt 
ggcttctaaa 
tagttgtcag 
tcctctgtca 
agtatttcag 
cttttatgat 



140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
141060 
141120 
141180 
141240 
141300 
141360 
141420 
141480 
141540 
141600 
141660 
141720 
141780 
141840 
141900 
141960 
142020 
142080 
142140 
142200 
142260 
142320 
142380 
142440 
142500 
142560 
142620 
142680 
142740 
142800 
142860 
142920 
142980 
143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143940 
144000 
144060 
144120 
144180 
144240 
144300 
144360 
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ctccagaggt 

tttggttggt 

ttttgttttt 

agaaactggc 

cacgtgccac 

ccagcattac 

ttaagagttt 

ccagaggtcc 

gatctgatgc 

taaataattc 

ggatcataaa 

cactgctgta 

cagaaggttc 

ttatctgaga 

tctcattcaa 

ctcttccatc 

gcagtggaaa 

acactgacat 

cacaaacaat 

ttccaatagg 

gattccaaaa 

caaagacttg 

acatagcagc 

cctctacagg 

aagatattaa 

gtcatcatgg 

agctgactgt 

gcattatatc 

tagtgcacgc 

gagtttacag 

taaacaaatc 

aggtccttaa 

ctggagttaa 

agaagaacaa 

actgaggcat 

gagctgctac 

tttttaaaaa 

aaaaactgct 

taaattttgt 

tcattttaaa 

tcatttttcc 

ggtttgatta 

gactgtagat 

ccctatattg 

gagttccttt 

gtcatctctc 

tcacacctag 

aatctcacta 

actcacagag 

aagcccagca 

ctattgagcc 

gacaggccta 

agggttaaag 

taataagaca 

cactcaggag 

aatgggaggg 

tgtaatttaa 

tcatggtttt 

tccttttgtt 

gaaagactgc 

ctccaggagg' 

ctggtgcttt 

caagaaaagg 

atagccctgg 



ttgtttttgt 
tggttggttt 
aatttaatgg 
tttgaacttc 
cacatctaac 
tttgactctt 
gatccagggc 
tgagttcaat 
cctcttctgg 
tttaaaaaaa 
ttcaaggtga 
ttcatttcct 
agtattcttt 
tgcagagtcc 
ggcctaacac 
tgggtcagta 
atgagctcct 
ccacccagac 
atgtagttga 
ccagcacttc 
tgtgtccaga 
aaagatggct 
tcacaatagg 
cagaaagcac 
ataatagccc 
gcagtgtaaa 
gagtggcttt 
cgtggatttt 
ctttaatccc 
agagtgtcta 
agatctgtat 
ggacaggaaa 
ggactgttgg 
gaatctatct 
gctatgtgtt 
taaaacctaa 
aagagtgggc 
gctgctgctg 
tgtatacttt 
ttggcactgt 
atttatacag 
cagaaactat 
ccaaaggaca 
atcatctctt 
catcacttta 
tttcagatga 
cctcttactc 
tgtagttctg 
atctgatagc 
aaatttacct 
agtcatggtg 
tgggttcggg 
ttttgagaca 
aatcagacat 
gtgagatcaa 
gaggaggaag 
ggcctaagct 
ctgtcttcgg 
ttcaggcatg 
agcccaagat 
caagacagac 
cctcttacca 
agactctccc 
gggaagagaa 



gggtttatta 
tgctttactt 
actggttcca 
tggtcttcct 
aatatctggg 
catagtttct 
tggagagatg 
tcccagcaac 
tgtgtctgaa 
aagagtttga 
gcctgggtga 
cattgatctt 
ttttcccaat 
cctttattcc 
tgaggacatt 
tgaagtgagt 
ctctgttctt 
ccattctctc 
caagaagtag 
tgtgtccata 
tattacctcc 
tgatggctct 
ctataacgtc 
agacatacat 
acgcttgagc 
agtaaaggtg 
cttctcatcc 
aagttaggaa 
agcactcagg 
ggcagccagg 
ttcaggaaga 
gtattccaca 
aatagttgaa 
gggcatggtg 
tgtggctaac 
aaaacaaaaa 
agaatgtctc 
ctgctgcata 
agctgtgctg 
acagaaatta 
aagcaatgta 
caaagtattc 
agtggagaaa 
ttacttctaa 
gatatataat 
gaaacaaaga 
actccatgaa 
gttgtcctag 
ctctgcctgc 
tcttaatttt 
agacaggctt 
gcagcctgat 
aaactttgtg 
aatattataa 
tccccagcat 
gaagagagaa 
ctcgactgaa 
tacatagtga 
tcttaaagac 
gatggtgcgt 
catcagtgat 
gattctgcaa 
agaacccatc 
cacacagatt 



gtgttttttg tttggttggt tgtttgtttg 
tttcttattc atttttttat ttctttattt 



tgtggccaag 
tcatctacct 
tttctttatt 
tccagagtta 
gctcagtggc 
cacatggtgg 
gacagctaca 
tccatttaca 
actggcagaa 
cagaggtttg 
tctattcagt 
tgtgaccatc 
tcactgtgct 
attgaagaac 
ctgcacacat 
ttctgactca 
ctatgtcatt 
tcagctagat 
ttcgtttgtt 
tccagaggct 
atttccaagg 
gcaggcaaaa 
ttattcctct 
ccccgccctg 
cctaactgct 
atgacaaaga 
aagcagaagt 
gctatataga 
tgcctatgag 
tgtgtgcatt 
tgttcacact 
atgcacaact 
ctgggctaca 
tctaataatc 
tgattttgtt 
gcacacagct 
tcaactttgg 
acagccatat 
ctgtattaaa 
tctaaatgat 
aacttaggaa 
taatagtggc 
ttatctcatc 
cattgaaaaa 
tattcctttt 
aactcaatat 
cgagtgctag 
ctaagacgtt 
aatctttaat 
ctatagagag 
tcaaaaacaa 
acaagtatta 
aatgagggga 
aaaggaaaga 
gttgtccact 
agagtggaaa 
atagtggaag 
aagacagaat 
ctcgaaagca 
aggaatgtcc 
cctgcagctc 
tctggggcct 



gatagcctca 
cccaagtgat 
ggagtttgaa 
ttacactttc 
taagagcacc 
ctcacaatcg 
gtgtaatcat 
ctggacttct 
gtggcagaag 
ctaacgggaa 
ctttagtagt 
tcgttgtttt 
atgccaatcg 
ctggtgtcct 
tgaaatcaac 
tgtcacctct 
gtccacagtg 
gtgctgctga 
tcttctaaat 
gggtttgatt 
ggtctgactt 
cacataaaca 
gatgatacag 
cccaggggca 
gcaacatcat 
ttagatctat 
aggtggatct 
gaaattctat 
ctgtttgaca 
ttacagaaat 
cagtgttctt 
gtattcctaa 
tagtcaatat 
tgggatttta 
cagatggcca 
aatatttgac 
aaaaaaagta 
tagtctagac 
tattcagtct 
gaaaaaagat 
gaattttccc 
catttattga 
ctaaaatgac 
tctaacttgc 
ttttttttcc 
atagaccagg 
ggttaaatgg 
tctcctctta 
ctcagcactt 
agttctatgg 
acaaacaaag 
gtgtcactca 
ggaggggaga 
tctcaaagca 
tttaatgacc 
ccaagggtca 
caggtatgca 
gcctggtgaa 
gcctcactgc 
agcatctgaa 
tcaccaatgc 
ccctctctgt 



actttgtagc 
gggattaagg 
agggattcct 
atttgttaca 
aactgctctt 
tctgtaatgg 
ataaataaaa 
tgaggcagca 
ctgtgtctca 
gtaagtggaa 
agatccctca 
tcagggagtg 
ctctgcagcg 
gcctgtggct 
tagcttgcaa 
cataggtgac 
catggatttg 
tagtatttta 
aagccaggca 
cccagcccca 
cctgttctgg 
taattgaaag 
ctctcctgaa 
tcagtgaggt 
caactgtgga 
ggccaggcac 
ctgtcagttt 
ctggaaagaa 
tgtgtgatag 
tggttataca 
caaatcaaat 
catgtagaag 
tggacagtca 
tattttcctt 
cagaacctag 
tatacgtata 
tcccagttta 
acattaaact 
tatctacagg 
tcaagaatct 
tttatcccct 
acatacccag 
ctgttgatga 
ctgcataaga 
tttgagacag 
ctagcctcaa 
atgtgtcacc 
aaaaatggaa 
aggaggcaaa 
gttaaagttt 
ccagactgct 
attaaaaagt 
gaaggaaatg 
gaacacagga 
cttttcatgc 
tcctggaatt 
tgggtatcct 
cctggggaga 
caagccaacc 
aataaagatc 
cttcgaggat 
cagcttatgc 



144420 
144480 

144540 
144600 
144660 
144720 
144780 
144840 
144900 
144960 
145020 
145080 
145140 
145200 
145260 
145320 
145380 
145440 
145500 
145560 
145620 
145680 
145740 
145800 
145860 
145920 
145980 
146040 
146100 
146160 
146220 
146280 
146340 
146400 
146460 
146520 
146580 
146640 
146700 
146760 
146820 
146880 
146940 
147000 
147060 
147120 
147180 
147240 
147300 
147360 
147420 
147480 
147540 
147600 
147660 
147720 
147780 
147840 
147900 
147960 
148020 
148080 
148140 
148200 
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acacaccctt 
ctggtgaagg 
cattgccttg 
tgtctgaagc 
agccaggaat 
gggctgctcc 
agatttgaag 
acctgtttac 
agtgaagaga 
tatggaagta 
gaacacagat 
agcgtgcagg 
gcagatttga 
gcccaggcaa 
attggtattt 
aataaatcag 
agctttgagc 
tctaagacag 
tctcttaaaa 
caagaggcaa 
caggctacat 
caatagcaac 
gcttttttca 
catgtatgtg 
gccactgagc 
atttattcat 
ctaccatgtg 
tcctgaacca 
tattatatgt 
tagagatgat 
gcagtcagtg 
tggtttccat 
gaacttggtg 
aatgatggtg 
catactacac 
tggagagtga 
aagaaaagtc 
tttctccccc 
gccggaggat 
aggacgacat 
atgttttgcc 
gggcatcaga 
ggaatcaaac 
tttattacat 
cccccccccc 
tctagttaat 
cagattaata 
tacactgtcc 
tgccaattta 
agcagtagga 
agttagtctc 
cctctcaggg 
gtgagtggaa 
atgggtgaga 
ctctgacaat 
ttattggcag 
ggacagaggt 
cccaggaatc 
tttcaccaag 
gagaaggaga 
tcaagacact 
tctgccttca 
tggctcggct 
ggtttggttt 



tgaaaggccg 
aagatgcttt 
agaacaagtg 
acactttggc 
gtatgcagca 
tcacactcgg 
atgttgatag 
ctgaaagcaa 
agagcaagat 
attataatgc 
tcatctgtgc 
cccaaacctt 
ctgtttgggg 
taacatccat 
caagagctaa 
tgacctaaaa 
acatttctgt 
ctttagctat 
acttgagcca 
aggcaggcag 
agtaagatcc 
aaattaattt 
gacacaccag 
gttgctggga 
catctctcca 
tttgtgtgtg 
tgtcttggag 
tctccctagc 
aagtacactg 
tgtgagtcac 
ctcttaactg 
tttatccaaa 
tgtgagcatg 
gcctgcctga 
actagctgag 
tcatctccac 
cttttaaggt 
tccctcattg 
ctttgtgaac 
agatataccc 
tgcatgtatg 
tctctcagaa 
caaggtcttc 
attttcctca 
ccccgagcag 
taagctgaaa 
ctctgttagc 
agattttgcc 
tacagtgttt 
tttgaaagct 
tgtggtacac 
tgtccagaag 
ttctgcagca 
gggccttctc 
cttagcctgt 
taaatcagag 
cattggctga 
tcaggtgctt 
gttatgtatg 
agccctgctg 
gatagacttt 
ttggctcttt 
ag'taaaggtg 
ttgttttttt 



ccatctagtt 
tgtaagtgtc 
ggataccaga 
cacagtacct 
ggcatgggac 
ggtcttctgc 
agttgtaagg 
gaatctggtc 
aggtgaaccc 
tatcttcagc 
catcagtgtc 
aaagtcccct 
agcagccatg 
atgtcacaca 
tgtttaagga 
ggaaaacaca 
gaacccagag 
acagcacgtt 
gagccaggtg 
atccctgaat 
tgtttcaaat 
tttagatgta 
aagagggcat 
attgaactca 
gtccattaat 
gtatacatac 
atcaaactca 
catttctagt 
tagctgcctt 
catgtggttg 
ctgagctatc 
gatgccttta 
aagaccagag 
gattccagga 
ctgtgtgctc 
aggcaagcac 
ggtggtggtg 
tgatggcaca 
tggaggtcag 
tgtctcaaac 
tctatgcaca 
ctggaatttc 
tggaagagca 
attacatttc 
caagtattct 
gggagggagg 
ctaactacac 
atcctttttg 
ataggtattc 
gacatttatg 
attttggggt 
ctgctgcaca 
tcccggtcca 
caagggcctt 
gcatagtttt 
gttatccact 
aggataacgt 
gctgactggg 
gcagagggta 
ataagggaag 
gtccttaatt 
gagccactgc 
tttgcctcta 
gagacaaggt 



gccacaaaag 
cttatcccag 
gttaccagtg 
tgtggcagct 
ctctaccata 
catgttgccc 
atgctttgtg 
aaggcatttg 
ccaattcctg 
cttttgcccc 
tccgcactgt 
aagctgaaag 
ctgcgacaca 
ggtaagtcag 
gaaaaacact 
gccgtcttat 
cttgggaggt 
tgaggtcagc 
gtggtggcac 
ccagcctcgt 
aaataaatat 
tttatttatt 
tggatcccat 
acacctctgg 
taaaaattta 
tataatacag 
ggttcttagg 
attcttttct 
caaataccgg 
ctgggatttg 
tccagcccca 
agggcctggg 
ttcagatccc 
caacggagac 
aagagagcag 
acacctgagc 
ttgttgtttg 
ttcctttaat 
cctgttctac 
agacaaaaat 
ttatgtctgg 
agacacttat 
gcaagtagtc 
caatgctatc 
tcattgctgg 
tagatgttgc 
tatagaagct 
ngtgtatatg 
tttgtgacgt 
tggcctatgg 
ctatctgcgg 
ctgggctgga 
gctgggagtg 
cttagtgtta 
ttattcgaga 
cttagggaag 
actgagatgc 
tttcttaggg 
taggggtttc 
tcccccattt 
agcagggccc 
tacaaaatat 
agcctgaagg 
ttctcagcat 



acattctccc 
gagaaatgcc 
gagacctcta 
gcagctatgc 
gcacagagtg 
tgagaatgac 
acgaggttgg 
attacacaag 
gatgcaatgt 
atactgaaaa 
gcagacaact 
cagtgacccc 
tgctacagct 
aaatggtttt 
ataaaggaag 
atatcattat 
ggagatagga 
ctgaactaca 
atgcatttaa 
ctatatagtg 
aacaaaaaca 
ttatgtatga 
tacagatggt 
aagagcagtc 
aaactagagt 
gttcataagt 
catgggagca 
' tttgtcttga 
"aaaggggaat 
aactcaggcc 
tttttagtat 
aagatggctc 
tagcacccag 
agacaggggc 
ccctggctta 
acacagacat 
ggggtctttt 
ctcacatctg 
atagcaagcc 
tatttatttt 
tgcccatgaa 
caagtactgc 
tttttttttt 
ccaaaagtcc 
gccatctccc 
ccaaacttag 
tattctttag 
tctacagatc 
ggatctttta 
tcctgttaaa 
ttccgcatct 
aggatgaagt 
aatgctgggg 
cagctctagg 
tgagcttgta 
gagataggaa 
tcattagcac 
ggttgagagg 
aggctcccca 
tgagccactt 
aacaagtgtc 
ccaaatctgg 
cctgagtttt 
agccctgggt 



aggagaactg 
acgacctcat 
ctgtcaccga 
caagtattgc 
ttctcttggg 
tcttttagcc 
tagcacagac 
tcagggagag 
caatggaaag 
gcatagccca 
caaagctgac 
agggctgtgt 
gcagtgtaat 
tacttacatt 
cctggcatca 
gctattgaga 
tgattaggag 
tgagaacttg 
ttctagtact 
agatccccac 
gcaataataa 
gtacaccatt 
tgtgagccac 
ggtgctctta 
atttttaaac 
caattctctt 
agtatttact 
aagatttatt 
caggtcttgt 
ctccagaaga 
tcttattaag 
agtgggcatt 
gcagatgctg 
cctagctaac 
ctgtgcagga 
gcacaaagga 
gttttgtttt 
ggacaaagag 
catttcagcc 
atatatttga 
agccagaaga 
ctgagtgcta 
ttaatatttt 
cccataccct 
catctccttt 
gatttattga 
actttcacat 
ttaattcagc 
cccatcttaa 
tcacatttca 
cacacttttc 
ggagtccaga 
tcaggaggag 
caaaggcctt 
tgcatacact 
tacccaaggt 
ggggaggcat 
gtagcagtga 
gacaaagaag 
cagaaggcta 
tgttttcttt 
gccaggaagc 
cacttgattt 
attctggaac 



148260 
148320 
148380 
148440 
148500 
148560 
148620 
148680 
148740 
148800 
148860 
148920 
148980 
149040 
149100 
149160 
149220 
149280 
149340 
149400 
149460 
149520 
149580 
149640 
149700 
149760 
149820 
149880 
149940 
150000 
150060 
150120 
150180 
150240 
150300 
150360 
150420 
150480 
150540 
150600 
150660 
150720 
150780 
150840 
150900 
150960 
151020 
151080 
151140 
151200 
151260 
151320 
151380 
151440 
151500 
151560 
151620 
151680 
151740 
151800 
151860 
151920 
151980 
152040 
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tcactctgta 
gcttagatta 
gatggctcag 
gcaaccacat 
ctgaagacag 
aagaacacta 
ttcattctag 
gaaaggccac 
ttctacctgt 
gatggttgtt 
gccaccacat 
agttacccat 
gcacatattt 
agcccagttt 
aaaaacaaac 
ttgggcgtgt 
ttgggaggca 
gaaaccctgt 
gttaatatga 
ggagacattc 
cnnnnnnnnn 
nnnnnnnnnn 
aaccgtcagg 
attaaagcac 
gagaggcaca 
ctgggagaag 
agcagctatt 
tctgcagggg 
ttgtgtctat 
gtgtgtgtgt 
gtcagtgctc 
ggccaggctc 
taggaaataa 
aattggattc 
tatgcagagc 
gactgggctt 
tgcccgtcct 
gctactgaaa 
cagggccctc 
ctcctgaggc 
ccatccaacc 
agaaacctct 
gctaatgtct 
tgtgttgcat 
tgattcttgc 
gcccagggac 
tttttttctt 
tagtttataa 
tttatatatt 
taagtggaca 
aaatgtcatc 
ccgggggctg 
gtggcatagg 
tctgagattg 
tccaaaacca 
agcttcttca 
cactgtgctg 
atacccttcc 
cggtggctga 
aaaaaaagaa 
agtgctgggt 
ggccaggagt 
acggatgggg 
gtcaaagctg 



gatcaggctc 
aagttgtgcg 
tgggtaagag 
ggtggctcac 
ctacgggtgt 
ttcacaaggt 
gggaaaaatc 
agcacgtgta 
ttttgaggca 
ctatttctgc 
aagcatggca 
caagccatct 
aatcccagca 
acaaatcaag 
atccttctgt 
caaaaatcac 
gaggcaggag 
ctcaaaaaaa 
aatagcagaa 
catactgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcagaggca 
caaatcttgt 
catagcttcc 
cttgaagctc 
cagcagctgc 
aaagacaagg 
gtgggtgagg 
gtgtgacaga 
cctttggctc 
gcccacccca 
aaaacagggt 
agcatttctt 
acaagagact 
gcctgcaggc 
ggggtcatcc 
caccctcatc 
agtaggtcac 
cagagctcag 
cattggacta 
tccccagctc 
gcagcttcta 
ataaaaacga 
tgcttctctt 
tattgttgct 
gctagaagtt 
acatgataag 
tatatatata 
taggtataaa 
cactggcgat 
cctgagacgt 
gttggacagg 
ctggttgccc 
gcagggtcct 
aggggcgcat 
gagtggatac 
cgtctacggc 
aggcttcagc 
agaaagaaag 
gacggagccc 
ggggccccaa 
gtcctgcgag 
aacttgtgct 



aacttgaatt 
ccaacactgc 
cacccgactg 
aaccacctgt 
acttacatat 
acaacaacag 
tttttaaact 
tccaggttag 
gtctcttgtt 
ctctcatgtt 
ccagcactga 
tgctagcccc 
ctcaggaggc 
tcccagaata 
gttccagtca 
aagtagcaag 
gatctctgtg 
ccaagcaaaa 
ctcaagatag 
tatatgcaga 
nnnnnnnnnn 
nnnnnnnnnn 
ggatggcagt 
actaaaaaca 
gggcccttgc 
aagaactggc 
ggctcttggg 
agatggcact 
gaagtgccgt 
gacagaaagg 
ctcctgccaa 
ccaaagccca 
tcttgtccaa 
actcctcagt 
gtgcttgcca 
tgccttctgg 
aggaggagca 
tgctaaggcc 
tggccaaaga 
agcctccctg 
acactgtaga 
cgcctagctt 
cagtaggggt 
ggtgatgttc 
ccttcccgac 
ggctggggtt 
accgatacag 
acatacagtt 
tatatattat 
actgcacctt 
ctctcctttg 
caggaatgag 
gcaggttaac 
cagctggctc 
tcagcccttc 
gtcccgacac 
ggtccagagg 
tgtacatttt 
agccttttcc 
aaaagaagaa 
tgggtcctca 
gggccgtgcc 
cacacttgcc 
gatagtcggg 



cagagatctg 
ccacctaaaa 
ctcttccgaa 
gatgagatct 
aataataaat 
tgtactagga 
ttagtgtgta 
gagtcaactt 
tctgccctac 
gccctatgca 
gccaagggca 
agaaaatgta 
agaggcaggc 
gccaaggcta 
caatgactgc 
gtgtaatggg 
agtttagcac 
atagaattac 
ctaatgaaac 
gcctgatgct 
nnnnnnnnnn 
nnnnnnnnnn 
gatgtgtttt 
gccctctgct 
cctctgctct 
cgctgttgcc 
caagcttgga 
gagctgggta 
gtgtgtgtgt 
gagagtgcat 
aaagcatgct 
ggatctgcca 
gagaattttt 
atcctctctg 
gaaacccttg 
gctgacccct 
ggaccactgt 
actagatttt 
gcctagatgt 
gcctgcacaa 
cgctgccttg 
gcctccagca 
gacggggtgc 
taagtatcta 
tcttcccact 
ctgaacaaaa 
tccttaattg 
tggtaaggag 
gtgggtgtgt 
ctgtgtgact 
gtgattggtt 
aggcattacc 
taagtgatct 
tgggagagcc 
tctccaaatg 
cactaatgac 
cgctcggtca 
gggcttataa 
tgaaacccag 
aaaaattctg 
gcggagagtg 
tgaggatgcc 
cttcctcttg 
ggcatagtcc 



cctgcttcta 
aaaatatgag 
ggtccgaagt 
gatgccctct 
aaatcttaaa 
aactttaaaa 
agaaagagag 
ttcagaagcg 
actttgtac^ 
tgctgagctt 
ggcatctggc 
tttttgacag 
agatctctgt 
catagagaaa 
tgtaacaata 
caggccttta 
agccagggct 
aagttaacca 
aaaggattct 
tgcctagcaa 
nnnnnnnnnn 
ngcctgggca 
ggggcacaag 
ctcgatctgc 
cccggtcagt 
tttgtgctga 
agtagctcgt 
gccaggatgt 
"gtgtgtgtgt 
gttcttgtgt 
gtgttctcag 
ccccattaag 
attattattt 
gtagggaatt 
ggccaacagc 
gagtctggcc 
gatacagggt 
tttcccactg 
taaaactgca 
agtgctaaga 
tctgccagct 
ccctgacatc 
tccggggtgc 
agaatgttgg 
gacacctttg 
ttgccgcagt 
agcaaatata 
tgggttgggg 
gggcagagtg 
ctcattgcga 
cttggacccc 
cccatggata 
cagacaagag 
ttgttctgag 
accaggcttc 
tcactttgcg 
ggacagccga 
tccaccagga 
cagatcttcc 
ttccttgctg 
actgccagcc 
tgctgcaccc 
ggtcgtgcgg 
tgcagttcgg 



catcccgagt 
gggctggtga 
tcaaatccca 
tctggtgcat 
aaaaaaaaaa 
tagcccatca 
aggggctgta 
gagtctcccc 
tgaacttcaa 
actgatgcca 
tcacatggag 
gtgtggtggt 
gtctgaagcc 
ccctgttttg 
atatgaggat 
gtctcagcac 
gttacacaga 
gggtattggt 
attttataaa 
ctgaactaca 
nnnnnnnnnn 
agagtaagca 
gccccctttc 
agagatacga 
tcctggactc 
cgccagaacc 
tccttctcct 
gggaagtaaa 
gtgtgtgtgt 
gttcctgtga 
agtcaagctt 
aatgcggggt 
tttctctctt 
caggtctgtc 
cctattccct 
ctctgacctc 
tcctgagatg 
ccagactact 
gctaaagcct 
aggacattgt 
tgaaaacccc 
cacttctctc 
cagacaggcg 
tcccctgaag 
cccccagcaa 
tttttgtttt 
ggttcccgta 
gacctgtgca 
aggatatata 
gtacagttct 
agcaggtctg 
gggactgagg 
ccaagagtgc 
tcctgctcct 
cgcagagccc 
tgcctttgac 
gtgagacgtg 
agggtccaga 
acttaggaaa 
gacatgtggc 
ccagtatcca 
cactgggggc 
tgggcatgat 
taagctcttt 



152100 
152160 
152220 
152280 
152340 
152400 
152460 
152520 
152580 
152640 
152700 
152760 
152820 
152880 
152940 
153000 
153060 
153120 
153180 
153240 
153300 
153360 
153420 
153480 
153540 
153600 
153660 
153720 
153780 
153840 
153900 
153960 
154020 
154080 
154140 
154200 
154260 
154320 
154380 
154440 
154500 
154560 
154620 
154680 
154740 
154800 
154860 
154920 
154980 
155040 
155100 
155160 
155220 
155280 
155340 
155400 
155460 
155520 
155580 
155640 
155700 
155760 
155820 
155880 
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cccagagctc 

cttcttgtaa 

ggccccatgg 

agacttgatc 

cagcttcaga 

gccacggaac 

attcccattg 

ctcagtgagg 

cttgtggtga 

gccccggaag 

gagattgacc 

gtagagatac 

acacttatag 

cgttcgcagc 

ggtgttggga 

gctgaagtgg 

gtcctctggg 

gggcgcaggg 

gagcagcaac 

ggggctgccc 

aatgactctg 

cagggcattt 

tctctaattc 

ttgtgaacac 

tagctgcctt 

tatttaaata 

aataagcgcc 

gtttcctcat 

tgaggggctg 

tgccagatcg 

attgccaggc 

tgtagcttcc 

attttcggat 

ctcattatgg 

ccagcaggcg 

ggccaggcag 

gttcagctgg 

ttcttgctgc 

cccacttggg 

gacttctgac 

gctgttggga 

ggcttcagct 

aattgggttc 

taggacccct 

cggcccttag 

tgtctagatg 

ccagcctctg 

ctgccctgtg 

tccctgcttg 

cggtcccctg 

tttatgctgg 

cttttcttcc 

accattcgac 

ttcccctgtg 

ggaagtccct ■ 

gctctattta 

agtttataaa 

actggggttt 

tgcagaacca 

caatggacca 

ctggaagcat 

tccacatgcc 

tgaggggctc 

ctgagcaagc 



accttagaga 
cctgacctgg 
gacggatggt 
tggtgaggag 
tcctggcctt 
cttcgcagcc 
aggcgaagga 
ctgttgttga 
acccactgta 
atgccttggc 
aggtccacaa 
tgcaggctgt 
aggtagaggg 
tgtcggttgt 
gcaatgaaag 
ccctgctgga 
atgccctccg 
tagcacacac 
agcagcaatt 
agaggaggtg 
ttggctcaca 
ggggccaagg 
catccgaaca 
aggcgagtcg 
agcaaggaat 
attcctcaaa 
tctttcacac 
gagttaaaca 
tgagtgacac 
aggctgcatg 
tgtggctgta 
ctctaattgg 
gtgtccagcc 
acgattgtca 
cgtcaggaag 
ccacgaccta 
gactcttggg 
tagctgtgct 
gtcccaggcc 
gccccccccc 
gttatgggct 
agaatgtttc 
agttgtgatt 
ttctgggccg 
acagcatgca 
ttactctgtc 
tcctagggat 
ccaccagcac 
gcttgcaccc 
ggaaagcctg 
atgagacaca 
ccggcctgca 
atttgtacag 
ttctcccacc 
tggggagatt 
ggaaaaaaaa 
cataccgtgg 
tccctctaaa 
ggctcagttc 
caaggacctt 
actaatagga 
agggatgtta 
agccactctg 
tgtcctagtc 



tctggttccg 
agccaggcgg 
gctccttgcg 
acactggtcc 
gccgcagctc 
attcccacag 
actccaaggc 
agagaaagag 
gctggttctc 
ccaggctcca 
agatgtcatc 
gcaggccacc 
cgtgaagctt 
ctccaaggtc 
tgatgttgtt 
ggaaggtgat 
ggatggcagc 
agtctcgagg 
ccacacagca 
gagatagatg 
gaggttctgt 
cccacattat 
cttcttcaga 
gcctccggct 
gactctaaca 
ataaagaaag 
tgtggtacaa 
cagttcacag 
cagggctgac 
ctcctctccc 
aaaccgggcc 
gcattaatta 
tgtggtcggg 
ttacctgcct 
atggggacag 
tgtcgcctca 
agagccaggg 
cagagcaagt 
cagggactgc 
cattcccaac 
cccagagagt 
tagcacataa 
tcagagcaat 
ggccatgccc 
ggctataacc 
gatcttcctc 
cctgtgctgc 
tcacactggc 
tctcttccag 
gactctaggt 
gcagagcctt 
gctgcaggca 
ctcatcggag 
tactgtggct 
acccccctta 
aatcaaactg 
caagtggagg 
agggcaatgg 
ccctgttatc 
gtccttgtag 
tgtaggctca 
accttccaag 
catccatgtc 
agccctggcc 



gttcctgtgg 
atggccatgt 
ggcagccctg 
tgtgcagttc 
gggggtcgcg 
ggaacgtgcc 
caccaggggg 
ggtggttagc 
atgcagcagc 
tagcttgtta 
ttggaggtac 
aaagatgcct 
caccaggcct 
tagctcctcc 
ggagtagatc 
gcgattgttc 
aaagttgtgt 
acaaccacca 
ccctggtggg 
gggaacagag 
cctctgtatt 
ctcctcacct 
ctgtcagccc 
ctgggtccgg 
aagcaaatct 
cattgagtcc 
aaacttcaaa 
ggctgtgtgt 
gcacgaggct 
ccattcacac 
ttgcctcttg 
ggcattcgtt 
taataggcct 
ttttccaggg 
gctccaggcc 
gttggcctct 
cccctggggg 
ggaaggagca 
ccaggccccg 
agaaacagca 
ggcagctgct 
aaatcatcgc 
tatgacctca 
tgcaggccct 
atctctgcct 
ctcagcatcc 
agagggaggc 
tgcacagtcc 
gaaggccatt 
tcaaggacat 
cttcactggg 
ggtaggggtt 
tttacagagg 
gctctgtctc 
cataggggcc 
aaccaaacct 
aggcaggcgg 
ggggacaaag 
tcaaggctat 
gacagtcact 
ggacagctgg 
cctccatctc 
tttgaaagcc 
acttctggac 



ctggtgcagt 
gggtggcctt 
tcagaggtgg 
cggaagtcct 
caggggacag 
cggcagccac 
gccagacagt 
ctgtggaggt 
aaccggtcca 
ccatggagaa 
tcgatatggt 
gcgggcaggg 
tggaaggtct 
agatgcacaa 
cagagggtga 
tgcaggaaga 
gcctggcagc 
cccagaggta 
gagagacaga 
ggtggaatgg 
gtatgcaggg 
ctttagctct 
caccgcaggg 
ctctgccact 
ggagtcctga 
atggtaccaa 
cctacaactc 
acagacaagg 
tcccttgggg 
ccccccctcc 
actgtccaga 
aatggatcct 
atgctcatta 
tcacagctgc 
tacgggcgcc 
tgccccttct 
aatatgagct 
gggaccttct 
gcagagtagg 
tcgtaagttg 
tctcgtccct 
atataattta 
gcagcagggg 
ctgcagtgct 
caatttcctg 
tgggtgtggc 
acagtcggag 
acagacccac 
cttgccagaa 
tcatgatgct 
gggtcctgtg 
gcagtgggct 
gcttttgttg 
tgtgcacccc 
tccaaggaat 
caggtgtgga 
cagaacggca 
ggactctagc 
catcactagg 
tcctgcagtg 
tctctgtcct 
ttctaatggg 
agtggtatta 
tccttgcctg 



tcttgcctgc 
tgtccctgga 
taagcgtgtg 
ccaccctcag 
cagagctaga 
agtcccaagc 
caccctgcag 
catggaaagc 
ggttcaccag 
acaagtgact 
tgtcctgcaa 
cgctcagtcc 
cgggtgccag 
agccctcgaa 
ccatggcggg 
tgcgctcact 
tgacagtcat 
gctctccagc 
acagcagtga 
gggactggca 
gtcccttgga 
gtccctaaag 
tggaacacac 
cgctcactgt 
atgatcactt 
agcatgcctc 
ctccatggct 
cacatttctg 
ttcacagtac 
tgtgctggag 
gcatttcctc 
taaaataatt 
tggaagccgc 
cccaagtggc 
caccctgaag 
tttccagctt 
gagctgaatc 
gaccaggctt 
ttgccacctt 
acagctccca 
gtaatcaccc 
gtttttgcat 
tggcacagca 
ttctgcccac 
ctccaaaacg 
ctccagcctg 
ggaggggagc 
agctccaacc 
cctttcccaa 
tgccccacat 
aaaatgaaag 
tatcactaat 
taccctcaac 
aaaagaatct 
acaggctaca 
cttagtaacc 
tgaaggtagg 
caagacctga 
aggcctaagg 
aagtgctctt 
ttagtatttt 
gggggggtgt 
ctccaggacc 
agtcagtagg 



155940 
156000 
156060 
156120 
156180 
156240 
156300 
156360 
156420 
156480 
156540 
156600 
156660 
156720 
156780 
156840 
156900 
156960 
157020 
157080 
157140 
157200 
157260 
157320 
157380 
157440 
157500 
157560 
157620 
157680 
157740 
157800 
157860 
157920 
157980 
158040 
158100 
158160 
158220 
158280 
158340 
158400 
158460 
158520 
158580 
158640 
158700 
158760 
158820 
158880 
158940 
159000 
159060 
159120 
159180 
159240 
159300 
159360 
159420 
159480 
159540 
159600 
159660 
159720 
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tgccaatcct 
atggcgcctt 
attaatgtac 
aggaatccaa 
cttgggcagg 
cttcccctcc 
tcccacgcac 
acttgccccc 
atcatgcccc 
gaaattaaga 
cttggtcact 
tttgaactag 
gctcagacct 
tgggtcagct 
catggtccca 
atgtgcaaag 
agcctccaag 
accagaacca 
aggggccctg 
gtctttaatg 
agcatcagtg 
ccctgccttc 
tcgaaatggg 
agcacccaag 
ggtcagtggg 
ggccttcagt 
tgttaaccga 
ctggggctgc 
ctccttcctg 
ctcagcctcc 
cagggctgtg 
agcctcgttt 
cccacagata 
ccaaaccaag 
taaagatggt 
ctctggggaa 
actcgctatt 
gccctgcctc 
agtcaggaag 
ccccaggtct 
ccgtccctcc 
tacctggacc 
agcgatttgc 
acagcacgac 
tgctccctgc 
gtctctggag 
tcatagtagg 
ggagaacaga 
ccgtttgttg 
aaatagactg 
atgctaaaga 
cagaaatgtc 
aggtgcttgg 
aagtgagttc 
taaaaatctt 
acacacacac 
gggctgaact 
tgtctgtgta 
tggatggaga 
actcacatct 
cccaggccca 
gcccagcatc 
agcaccccag 
cccaggccca 



aggattgtca 
ctatgccccc 
caacaatgag 
gtctctgtgg 
gctgccttct 
aggaggtgcc 
gtgagctccc 
tagtctagga 
atttgcatat 
gaggggccta 
gatcccctct 
ggctgagaag 
cagcacacac 
ggcaagcccc 
tgggaccaga 
gcactggggt 
gaatactgta 
gcatcaacag 
tagcaccctg 
aaaagccctt 
ggctggttgc 
aggctggggc 
ctgggtcagc 
ggggctgccc 
tctctctgaa 
cctccctccc 
atgctctctt 
tggcgcctca 
gctcggccct 
actgctgtgc 
gagccagagg 
tcctcacctg 
ctggtaccca 
ctcttctctt 
ccctttacat 
attctgctgg 
gctaccattc 
ctcaccccta 
ggctgctcct 
ttctcccagc 
cacatgccag 
ggttcccagt 
aaggagcttc 
ataagtcact 
acacagtagg 
aattgaatga 
tgctcgataa 
cacctgaact 
gatgagcact 
ggactgggga 
aaaaaaaaag 
tgggtttatg 
gaggcaaaga 
taggacagcc 
agtgggggag 
acgttagtat 
cagagtgtta 
gatgtgtacc 
catggttgtg 
acctctggag 
gcaccccgac 
ccacgcccag 
acccagcacc 
gcaccccagg 



ccagcaggtt 
tctatgaggc 
gatggagcct 
aggggccctg 
acttgagggg 
cttcatcgtg 
aaggggctct 
gagccatgga 
gcggtggcaa 
aggaaagcct 
gtaacaccag 
ggtctgctct 
tcaacagaag 
aaactcttcc 
cctggtagga 
gggaaagggc 
acctcagctc 
ggcctgtctg 
gccaccagaa 
ccggaagcca 
agctggagaa 
tctagccagc 
tttttcaggc 
ctctacccct 
gctttgtagg 
tgcccccagc 
gctgtgggga 
ggccacttgg 
gaagacaaga 
agagcggccg 
cctaagttga 
caaaatgggg 
cctgcccttc 
cctctccggc 
cagctgcctg 
ctagatttta 
acccagtagc 
cacctccttc 
tagtctgcat 
ccagaggccc 
ccctcagcct 
gtgtctgcag 
tttcctggga 
tgtttccttt 
tgctcattaa 
tgacactcag 
atgttggttg 
tggctcacat 
ctgagaagga 
gtttcccaag 
ttagtcactc 
aagagaggaa 
cagatggagc 
aaggctacat 
cagtcaagga 
aatatcatac 
gtgtgtgcta 
tgaggtgttt 
tttgctaggc 
actgagagtg 
ccagcatccc 
catcccaggc 
ccaggcccag 
tccaagcact 



tcttcctagg 
ccttgggagc 
ttgccatgca 
cacctcttct 
gcggaaggga 
ttctgcttcg 
acagcctccg 
agacagtgtg 
taccctggtg 
ctgctatccc 
cccttctgca 
gttcagctgc 
gcatgcaagc 
tgccaagctg 
taggtggcaa 
ctgtgcttct 
agctgggctc 
ctaaacccag 
ctaacgagga 
catttgcaca 
cacagcaggg 
tccctgatgc 
tccaaagggt 
aagtagaggc 
ctgcttcttg 
ctggccagct 
tggcggcctc 
cacttgtgcc 
aagctggcca 
gcagcctcct 
atcctggctg 
cagtcattgg 
ccacatctcc 
ctcctccgtt 
ggaggaacac 
gaatgtatcc 
ctcctggatg 
aacaggaact 
cctccagagt 
tgcctctccc 
ccctcctgcc 
ggctgcttcc 
cattttactc 
atgtccgctg 
cagttctggg 
gaagccctag 
taattagtcc 
caagactcct 
cccaatacca 
gtatgggatt 
taaaaggggc 
gccagatatg 
tctgtgagtt 
agagaaaccc 
ggaaacacac 
tatggctctg 
gtggatattt 
atgtgtacac 
atttgcatgt 
acaacccagg 
aggcccagca 
ccagcacccc 
catcccaggc 
caatgcccag 



gaggcaagca 
cccgccccac 
ttttaacatt 
gccagactca 
gaagacccag 
ttactctcaa 
tcattccttc 
ggaagggctt 
ggtaccagga 
tgggctacca 
acctgccaga 
cttggctggg 
aagggagcta 
agcatgaaaa 
ggctaaggca 
caccccctct 
gggtggccag 
acctcacaag 
agatctgacg 
gaagaaaatg 
ggacaggtcc 
ctggagtagg 
caggacagct 
atccccatgg 
gccatgtagc 
gctcttctct 
aggccaggcc 
acttgtgttc 
gggaacagct 
atccatggct 
ctccttttaa 
aagctcagcc 
atctgtctaa 
cacaacttct 
ctcagaatca 
agtatctatc 
ccctcccccc 
agggtagtcc 
gcccatctaa 
tggctacaca 
tttgtccatg 
cagagaagcc 
tacagcacgc 
ccact'ttgtt 
ggagggaatt 
gctcaagcct 
tgggagactc 
taggcccatc 
gtcctttcct 
ttcagcacta 
caggaccaaa 
gtggggcaca 
tgaggccagc 
acttgacttg 
acacacacac 
tgcctgcagt 
gagctctgta 
aggtgttggc 
gtctgagttc 
gcccttttat 
ctccagaccc 
agacccagca 
ccagcacccc 
caccccgacc 



ctgtatcacc 
tgattgcctg 
gcaaattagc 
tcaagcgcct 
ttccactctc 
gcctccggcc 
ttccattcat 
gacaatgagc 
gagtataggg 
gtcagcattg 
gtttttgacc 
aggggaatct 
gcagtggcct 
gccacctcac 
gcggaatagc 
aatggtgcag 
agagcttggc 
ccagtttagt 
ctgggaatat 
aggtgcccag 
taccaagcta 
taaagcagcc 
gctgcagctt 
cccctgggca 
caatctctct 
gagcaatcga 
agctgcactc 
taaacacagg 
gggctcccat 
gtgagtagaa 
tgcttgcagg 
agtcctccag 
ctgaaaccat 
ccatcttggg 
ccgtggctca 
cacttatcat 
ccccccgcaa 
agggaaagtg 
ctagaagctg 
gaccttgctg 
ctgttccatc 
actcttgagc 
gacctcttgg 
ctgatgagtg 
accttctcaa 
ggggcctgga 
agagccttca 
aaggaagtga 
gggcaagggg 
aactggaata 
acctttcaaa 
tctttaatcc 
ctattctaca 
ccacccaaat 
acacacacac 
ccaggaatga 
tttatgtgca 
ctgttgcata 
atgcacataa 
cctgcagcac 
agcaccccag 
tcccaggccc 
gacccagcat 
cagcatccca 



159780 
159840 
159900 
159960 
160020 
160080 
160140 
160200 
160260 
160320 
160380 
160440 
160500 
160560 
160620 
160680 
160740 
160800 
160860 
160920 
160980 
161040 
161100 
161160 
161220 
161280 
161340 
161400 
161460 
161520 
161580 
161640 
161700 
161760 
161820 
161880 
161940 
162000 
162060 
162120 
162180 
162240 
162300 
162360 
162420 
162480 
162540 
162600 
162660 
162720 
162780 
162840 
162900 
162960 
163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
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ggcccagcac cccaggccca gcatccaagg cccagcatcc 
cagcatccca ggcccagcat cccagggaca gcaccccagg 
atcccaggga cagcacccca ggcccagtat 
cagggacagc accccaggcc cagtatccca 
tggaggcagc acatactgaa gatagggaat 
cctcattgct ttcagcacct gctctcctca 
tcccagtacc aggattcccc tcagctgagg 
gtctggggtt ggcagcccca tgctaattgg 
cagcacatcc tgcccattag gcctaacctt 
tggagcctac cctgtatagc tctgtattga 
tgggagtcat ttcacctagg caactccaga 
gtatgggaat aggtaaggag agcatagtga 
ctgcacggga gattgtgtca ctgtggttcc 
aactcaggat ggtccagaat ccagagctct 
ccagggctaa gacccttcct tggacggttt 
tctgtcatcc ccatctgtgc ctaacagctc 
ctacttctcc cagcatccca tgcccgcacc 
aatcttctgc taggtttgag ggccccaggt 
tcccttttac caccaccact aatcctcttc 



tttctncncn nnrmnnnnnn nnnnnnnnnn 



nnnnnnnnnn 
aaattgaact 
ccttgtgatc 
tagtggttct 
aggcattgct 
tcttaagatg 
agcatgtgct 
acaaaaccaa 
attgggacac 
gaggatgaaa 
aaaatatata 
ttgactgtat 
cccaaaattc 
caggaatgct 
acacttccag 
atgtccttgg 
aagtatttat 
ctaggcctag 
cagcctttag 
tgattcagtc 
gtcttttctc 
tctaatctct 
ttctctcttc 
tgtgccgctc 
ctgtgctgca 
ctctcttctc 
tttgtctgcc 
tttaccttca 
tgagagtaaa 
tctccataaa 
ctgccatgtg 
aataagcatt 
cgactacatt 
agtgaatttt 
ttatattaag 
tttcttcata 
cttatggaaa 
cctcccatct 
tgaactcctc 
gagataaggg 
catgacccct 
ctagtgatgt 
agcctatgaa 
cgaccccaga 



nnnnnnnnnn 
tacagaggaa 
agaaagctat 
ggggactgaa 
tatgctaagc 
cctataacca 
caaaaacaca 
acaacaacaa 
atttatacta 
agcccctgca 
gtgtatttat 
atctttgaaa 
ctgagttgga 
ctgatctaat 
gaggcagaaa 
agaggctacc 
ttgactgatg 
ccctggaagc 
gacttgctgc 
cccctgttcc 
ttggcctctg 
tgtctccttc 
catccttctc 
tactctccct 
ctctcttctc 
ctgagggttg 
actcaattag 
ttgtttcaaa 
gatgtgtgct 
taacagaatc 
gtgaagagaa 
ttcttgggag 
ttgccatgcc 
aaggcagcca 
aattgggaac 
tacaacttag 
ctgaagaata 
agagattgtt 
accctagagt 
cctcctaaaa 
tagttacgta 
aaacttgtac 
aaatttggct 
gctctggtct 



nnnnnnnnnn 
acacaaatga 
agattaaacc 
cccaaggaag 
tgacaaaggc 
tgggtggagc 
cataaagatt 
aaagtactta 
ttctcgttct 
gaacaacgca 
gtaaaacata 
agaaccagca 
ggttcaatgc 
caatggattg 
caaggtgagg 
tcatccttga 
taactgtcat 
ttctagcttc 
tgagatcacc 
gggctcaaac 
aattgctctg 
acactctctt 
tgtaaagctc 
ctcagctcta 
cggtaccacc 
ggcagatcct 
acatcacttt 
ttaaaggtga 
aataaggctg 
ttagggttca 
tgcagagagt 
ttggattaga 
ttgagactgt 
aggctgtggt 
aaaaaagcag 
tttttatagt 
aaaattttta 
cccggaacac 
tcgaaccctc 
caacctcaaa 
gattcccttg 
tttccctgcc 
ggtcgtcgat 
atgttccatg 



gcaccccagg 
cccagggaca 
gggacagcac 
gtctctgagg 
cactcggaat 
aatgggtagc 
actgacagtt 
gcctgcaggg 
ggcactcccc 
tgggcacaaa 
gcctggctgg 
agactgccaa 
taagagagca 
cttggctttc 
tctgtggaaa 
ccagctcact 
tgaggctgtg 
ctcctcttcc 
nnnnnnnnnn 
nnnnnnnnnn 
gctgtaatga 
ccagcccttt 
gacagctgct 
tttctgaatg 
aattctctaa 
gtttactgaa 
atagctcatc' 
atcacacgac 
cacaatgaaa 
cataaacaca 
gagtgctatg 
agagttcata 
accaactgat 
cctaactgga 
tccttccctt 
cttggaggct 
catacaatct 
gtttcctgtt 
tcctctcccc 
cttggtctca 
gcttgttctg 
tcccggtaaa 
ctgcactgct 
tgtgtctccc 
atcctgtcaa 
caagcaggag 
gtactaaggg 
agccaactct 
caatacgatc 
taaggatgtg 
gtcattcctg 
ggagggctga 
tggactgtta 
agtagaaagg 
tagcttttta 
ctgaactctt 
tcctgaactc 
ccaactaaaa 
atgaaccggg 
gcagaacccc 
cagctctccc 
tctcctctac 
tgctttcttg 



cagggacagc 
cccagcatcc 
gcaccccagg 
cccaggcaca 
cctcttatct 
caaacaccct 
taccattttg 
tctcctgaga 
gtgtgctgta 
caagctatga 
aatctctcca 
gcacctgaga 
gacatcttgg 
gatgctgaga 
tactctgtcc 
acatgaggcg 
gtgtgccctc 
ggtttccctc 
tcctcttcct 
nnnnnnnnnn 
nnnnnnnnna 
cagaaaagga 
ttgcctttgt 
tggaatgttg 
catatggtag 
gagtctgtct 
gcagtatttg 
agtaggaaaa 
tattaagaag 
ttgaatagag 
taaaaagaat 
gtttataaat 
ctcttaggaa 
agattaataa 
ggaagtttgt 
tctccgcttc 
tactggctcc 
aatccaagcc 
ctttctgaac 
gatgatttta 
aactaactct 
tctttactgt 
cctgcctcct 
ctccacagct 
ttacgtagct 
acctttctct 
tgcctcctct 
tgtgtctctt 
agctagaaat 
aaatatcctg 
ctcttgaggt 
tgacactgtg 
gcgatggatt 
ctagctacgt 
acagttttgg 
tgttgctatg 
cttcacccca 
ttcaccccag 
actgttccaa 
tacattgcca 
ttgtcccttg 
cccttgagtt 
accactaggt 
ctgttgttct 



accccaggcc 
caggtccagc 
cccagtatcc 
gtatcccaca 
tggtccttac 
gtgcaggttc 
gcttttgtct 
gcaatttggg 
ggggcaggga 
cccatgccag 
ataagggtag 
cctgagcagc 
ctttcacccc 
ggcacttaac 
tctgtcccag 
tatgagctct 
atgttactca 
catctgtccc 
cctcctcctc 
nnnnnnnnnn 
caaaggacgc 
atactatttc 
ttttcatatt 
gagacctctc 
tatctattca 
gaacttctta 
caacagtgaa 
agaacaacaa 
attaaggtgt 
aataaagaaa 
agaaaactct 
gaaaagttcc 
gtaactaaat 
tctgaaggca 
caccagaggc 
ctggctgtaa 
atcagctaac 
tagaatgttc 
tctagctggc 
ttcacaatct 
agcaatcttt 
gtctagtttg 
cctccccctc 
ctcctgtatc 
tccctttcct 
gattcttcac 
acaaaccaac 
tttcagccag 
agtttctttt 
agacagctgg 
ttcagatggg 
acaaaggact 
aatacactgg 
ttagctggat 
cagaaggagg 
accaaaatac 
gaacccgacc 
aatgctttcc 
gaacattttt 
aataatagga 
acagaacccc 
ttactatata 
gcatgagttt 
attaaatctt 



163620 
163680 
163740 
163800 
163860 
163920 
163980 
164040 
164100 
164160 
164220 
164280 
164340 
164400 
164460 
164520 
164580 
164640 
164700 
164760 
164820 
164880 
164940 
165000 
165060 
165120 
165180 
165240 
165300 
165360 
165420 
165480 
165540 
165600 
165660 
165720 
165780 
165840 
165900 
165960 
166020 
166080 
166140 
166200 
166260 
166320 
166380 
166440 
166500 
166560 
166620 
166680 
166740 
166800 
166860 
166920 
166980 
167040 
167100 
167160 
167220 
167280 
167340 
167400 
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gccttctaca 
tgagtgcttg 
acagcgcgac 
ggtctgatgt 
gctcagtgag 
tgtccaccgt 
cctggtggac 
tcccacctcg 
agaccgtggg 
ctcgtgcaga 
tgtggtcgac 
ctcttgttgt 
tgactctgga 
gaaagggtcc 
cacctgaggg 
aggaaggggg 
tcccacctcc 
cccagtctga 
ttgacgacct 
ctgcagctgc 
ctcggagtcg 
tgccactcag 
agtactggcc 
cagagaaccc 
cttgggatga 
tcctcataga 
acctcgtgga 
gtaggggacg 
gacggcccac 
cagtcttcct 
cagaggggca 
agaaaaagct 
aagcagaaaa 
aagaaataga 
ttttggccgc 
acagggcagt 
attgcaaaga 
tcctaaccct 
taactttgtc 
cagtactcac 
ctggtagtaa 
tgacccactc 
taaccaaact 
aggcccctgt 
aacccaaaaa 
aagcaggaat 
ccacccccat 
ctcatatcca 
caccacttct 
gggaggttaa 
taagctctct 
tttgcctgcg 
agggcggaca 
ccaccctgtt 
agcttaccct 
gtcaccaggg 
ctaaaaaggc 
gcaaaagatg 
ccccacggca 
gctttgcaac 
ggaaagaaga 
cgctagcatt 
tagccagggg 
caaaaaaatt 



ttttgagtac 
agtgagggtc 
cacccagagg 
ttgtgttctg 
accgcgctcc 
ccgttcaccc 
ccctttggag 
tgcccagttg 
ttcaagtccc 
gggtctcaat 
gcagagtcgc 
gtctactgtt 
gcattggaag 
gtggcagacc 
tgcttttgac 
tcaccctgat 
gtgggtcaag 
tgcagccgga 
cctctggata 
cccaccacag 
ccgaggccga 
agcacatgtg 
tttttcctct 
ctctggactt 
ttgtcagcag 
ggcgagaaaa 
cgaggctttc 
cctccttgtc 
caatttggct 
tgagcgtcta 
gagagccgct 
gcaaaggctg 
agtctatcat 
ggagagggaa 
agttgtgaat 
gaaaccgcaa 
gaaaggacac 
agaagatgat 
cgtggagggg 
taacccccta 
attttacccc 
ctttctggtg 
aaaggctcaa 
tgcctgcctt 
tgcagtctct 
ggggttggct 
ttcggtaaaa 
gaggttgctg 
gccggttcaa 
caaaagggtc 
cccacctgag 
tttgcaccct 
gactggtcaa 
tgacgaggcc 
actacagtat 
aactgagagg 
acaaatctgt 
gctcacagag 
ggtacgtgag 
cctagcagca 
acaccaaaga 
accagacttg 
agtattgaca 
agatcctgtt 



ggtctcagtg 
tcccttcggg 
tcctagaccc 
tttctaagtt 
gagagggaac 
tgggagacgt 
gccaagagac 
cgagatcgtg 
acctcgcgtt 
cggccggcct 
cgccgtttct 
tttctagaaa 
gaggtgcggg 
ttttgcgcct 
ttgtcactaa 
cagatcccct 
ccttggaccc 
aagtctggcc 
gactcccaac 
ggaccaatag 
agccccgggg 
agagggccag 
tctgatttat 
actgggctcc 
cttttgcagg 
aatgttctgg 
cccttgaacc 
tatcgccgga 
aaggtaagag 
atggaggcat 
gtagccatgg 
gaggggctcc 
aagagggaaa 
aatagacggg 
gatagacagt 
ggtggcaaaa 
tgggctagag 
tagggaagtc 
actcccgtca 
ggcaagctag 
tggacgacca 
atacctgagt 
gtccaattta 
gtcctcaaca 
tcaggttggc 
aaacaagtgc 
caatacccca 
ggccaaggag 
aaaccaggga 
ctggacattc 
agaacatggt 
aagagtcagc 
ctaacttgga 
ctccatcggg 
gtagatgatc 
ctcctcacag 
caaactgagg 
gcccggaaaa 
tttctgggga 
cctctatatc 
gcttttgagg 
actaagcctt 
caagcactgg 
gctagtggat 



tcttcttggg 
ggtctttcat 
acttagaggt 
tggtgcgatc 
gcggggtgga 
cccaggaaaa 
catttggggt 
ggttcgagtc 
tggtcacgag 
tagaaaggcc 
ggtttctttt 
tgggacaatc 
tcagagccca 
ccgagtggcG 
tcgctgccgt 
acattgtgac 
caaactcttc 
catcagcacc 
ctccccctta 
cgagaggggc 
aggaaggggg 
caccaggacc 
ataattgaaa 
ttgagtcact 
ttctttttac 
gagaggacgg 
gccccaactg 
ctctagtggc 
aggtcttgca 
ataggagata 
ccttcattgg 
aagatcatac 
cagaagaaga 
atcgccgtca 
caggaaaagg 
agataccact 
attgccctaa 
ggggctcaga 
acttcctgat 
gctccaaaaa 
aacgagctct 
gccctgctcc 
cttcagaagg 
cagaaaaaga 
taactgcgtt 
ctccggttgt 
tgagcaagga 
ttttagtggc 
ccaatgacta 
accccacagt 
atacagtcct 
tcctgtttgc 
ctaggctacc 
atcttgcgcc 
tcttggtcgc 
aactgagtga 
taaccttcct 
agactgttat 
ctgctggctt 
ctttgactaa 
ctatcaagtc 
tcgtcctata 
gaccctgaaa 
ggcccacatg 



tccgcggctg 
tttggtgcat 
aaggttcttt 
gcagtttcgg 
taaggataga 
acaggggagg 
tgcgagatcg 
ccacctcgcg 
atcgtgggtt 
atctgattct 
ttgtcttagt 
tgtgtccact 
caaccagtcg 
aacgtttaga 
caggcgaatt 
ctggcagaat 
gaaactgacg 
ccccaagatc 
ccccctgccc 
tcagggaccg 
gccagactca 
taatgatctc 
aactaaccac 
tatgttctcc 
aacagaagaa 
cacacccact 
ggactacaac 
aggtctcaga 
ggggcagact 
cacccctttt 
tcagtccgct 
gctccaagat 
gaggcaggag 
ggagagaaat 
taaaataggg 
ggaaaaagac 
aaagcgggag 
ccccctccct 
agacaccgga 
gaccatggta 
tcagatagac 
cctcttgggg 
cccacaagta 
gtaccggttg 
ccccaatgtc 
ggtagaactt 
agctagaaaa 
ctgtcagtcc 
tcgcccggta 
cccgaacccg 
agacttaaaa 
ttttaaatgg 
acaggggttc 
ttttcgcgct 
ggcggcctcg 
cttggggtat 
ggggtatacc 
gatgatccca 
ttgtagactc 
ggaaggggtt 
gtctctaatg 
tgtggacgag 
aagacctgta 
tctgaaagct 



tcccggggct 
tggccgggaa 
gttctgtttt 
ttttgcggat 
cgtgtccagg 
accagggacg 
tgggtttgag 
ttttgttgcg 
cgagtcccac 
ttgagttgct 
ctcgtgtccg 
cccctttctc 
gtggaagtca 
gtaggctggc 
gtttttcagg 
ctcgtccaat 
gtcgcggttg 
tatccagaga 
caacagccac 
gcgggggaga 
acagttgcct 
attcctttac 
cctcccttct 
catcaaccca 
agaaaaagaa 
gccctcccta 
accgcggaag 
ggagccgcta 
gaaccaccct 
gaccccttgt 
cccgacatta 
ttagtaaaag 
agagagaaga 
ctgagtaaaa 
ctcctgggca 
caatgcgcct 
cgatccaagg 
gagcctaggg 
gcagaacatt 
attggagcca 
aaaaatatag 
cgcgatctgc 
agctggggaa 
catgaagaac 
tgggcagaac 
aaagctgatg 
ggcatccggc 
ccctggaata 
caagacctcc 
tacaatttat 
gatgccttct 
agggacccag 
aaaaattccc 
cgaaaccctc 
aaggagctgt 
cgagtttcgg 
ctccgagggg 
tcgccaacta 
tggattccag 
cctttcaagt 
actgccccca 
agagcgggtg 
gcctatttgt 
attgcagcag 



167460 
167520 
167580 
167640 
167700 
167760 
167820 
167880 
167940 
168000 
168060 
168120 
168180 
168240 
168300 
168360 
168420 
168480 
168540 
168600 
168660 
168720 
168780 
168840 
168900 
168960 
169020 
169080 
169140 
169200 
169260 
169320 
169380 
169440 
169500 
169560 
169620 
169680 
169740 
169800 
169860 
169920 
169980 
170040 
170100 
170160 
170220 
170280 
170340 
170400 
170460 
170520 
170580 
170640 
170700 
170760 
170820 
170880 
170940 
171000 
171060 
171120 
171180 
171240 
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tagccctgct 
cccctcatgc 
cgaatgacac 
atcctcaacc 
atggacatcc 
agagctccca 
ggagctgcgg 
tcggcacaaa 
atcgttaata 
atctacaggc 
attctggccc 
ggccacccaa 
aaaacaagtt 
tgaggatggg 
tttttgaaag 
accaattcac 
aagtccccaa 
tcactaatgc 
gagtgtattg 
tgttagtatt 
ctgcccagat 
aggtaatcgg 
ctcagttggg 
tagagaagat 
gaaaagactg 
actccttttg 
gtttccgact 
gtgattagga 
gtaccccacg 
cttgagccac 
gttgacggaa 
gaagaaaacc 
ctgcgccgca 
gtcctgggag 
cctctggatt 
caattgggac 
cccaaatggg 
agctgcacat 
cgggggggca 
ctggaagccc 
gtcaaacgaa 
cagagcaccc 
cgagaacgga 
atacattcca 
ggcagtgaca 
tggctccccc 
taccctccct 
taggaacaga 
ctaaccctaa 
aaggaatagc 
gaaaaaacag 
tacctcagga 
gtcagtatct 
gtgtgtctat 
ccagactctt 
gaaaaagaga 
gagtaggtac 
gcagttatgg 
ttaacttccc 
aaaaaaagag 
ggagtaatca 
agaaaaagcc 
ctcctctcca 
tgcatcctta 



gatcaaagat 
cttagaaagt 
actatcagag 
cagctaccct 
tcgctgaaga 
gttggtacac 
tggtagacgg 
aggctgaact 
tctacactga 
agcgagggct 
tgttagaagc 
aaaggagaaa 
gctcaagggg 
aggtataaca 
aaaaatagaa 
ccacctggga 
cctgaagtca 
aactaaagcc 
ggaggtagac 
tgtagacacc 
tgtggccaag 
gtccgacaat 
catcgattgg 
aaataggacc 
ggtggctctc 
aagttctgta 
ctgaccctgt 
cccagatttg 
ggttccgagt 
ggtggaaggg 
tcgcctcctg 
acgacgacaa 
ggcgccactc 
gtgcttaatg 
tggtggcctg 
ctctcagatc 
atagggagca 
ttttatgttt 
tcagattact 
tcctctaaat 
ggagaaagaa 
tcaggaccat 
aaacaacacc 
ctaagagatc 
ctcgtagggc 
aaaggtcccg 
aggaactaat 
ggaccgtctg 
tatgactcaa 
tcagatcagg 
aaagttgact 
taaagggcac 
ggtgcctccc 
gtctgttttt 
gtatcatgat 
acccgttacc 
aggaaccgct 
atattgatct 
tgtccgaagt 
gactctgtgc 
aagattctat 
aacaaagatg 
ctatagcagg 
ataagttagt 



gctgacaaac 
atcgtgcgac 
cctgctgcta 
tctccctcta 
aactgggacc 
ggacggcagc 
gaaaaaggta 
tatagcgctt 
cagccgatat 
attgacctcg 
catacatgca 
acttggtggc 
ccatgatctt 
taaaagagct 
ttaactcccg 
gttgaaaaaa 
gtggctcaaa 
tacagagaac 
tttactgaag 
ttttcaggat 
aagatccttg 
ggaccagcct 
aaattacact 
ttaaaagaga 
cttcctcttg 
tggaggacct 
cttaccctcc 
ggaccaactg 
tggagacaaa 
accctatttg 
gatccacgcc 
ttggacagtg 
tagacctagg 
aaaaggaaaa 
atctcacgcc 
atactgatct 
catatgggtg 
gccctggtca 
tttgtggtaa 
gggacctaat 
acccctataa 
gcaaagataa 
gtctaagttg 
ctgggttcat 
ccaacaaggt 
actgtaccag 
actctcctca 
gtcagtctag 
tcatgctggt 
acttataata 
ctagcagcag 
ctctgtaatc 
ctagacacag 
aatagttcca 
aatagttctt 
ttaactttgg 
gccttaatta 
tagaactata 
ggtgctgcaa 
tgccttaaaa 
ggctaaactt 
gtttgaaagc 
acctttaatt 
agcttttatt 



tgacaatggg 
agccacctga 
aatgagcgtg 
acaaatgatt 
agaagtgacc 
agtttcctga 
atttgggcaa 
atacaagccc 
gcttttgcta 
gctggtaaag 
cctaaaaagg 
caagggcaac 
aactgaaaaa 
atggtagacc 
aagaaggaat 
tgatgagact 
agattataga 
ctggaaagag 
ttaaacctga 
gggttgaggc 
aagaaatcct 
ttgttgccca 
gtgcttaccg 
ccttgactaa 
cgctcaaaca 
ccccccttaa 
tctttgctta 
aaagcagcct 
gtcttggtca 
gtgttactga 
tcccacgtca 
gcagtcactg 
gaaccttaac 
cattgtatgg 
tgacatctgt 
tagcaaccca 
ttcggggcag 
gggtcagagc 
atggacatgt 
cacggtaaaa 
atatcaagag 
atactgtaac 
gcttaaagga 
tttcacgatc 
ccttataaaa 
ctccaccaac 
taaagcctac 
tccaaggagc 
tatgctatgc 
ctacttcaga 
tttcaggaag 
agacccagaa 
tgtgggcttg 
aagatttctg 
ttttagataa 
cagttctatt 
agaccccccc 
gaacagtcta 
aatagaaggg 
gaagaatgtt 
agagaacgcc 
tggtttaata 
acacttatgc 
agaaaaagga 



acagcaggtg 
cagataagat 

taacctttgc 
ccgtcccagt 
tgactgacca 
tagaggggaa 
gcgctttgcc 
tccgagaggc 
ccgcacac^t 
acatta^aaaa 
tagccatcat 
cgaatggcag 
ggtgatccgc 
agtgatcccc 
aaaatttgta 
aattaaaaat 
ctcctgcaaa 
acaacgggga 
aatgtatggt 
atttcccact 
gccaagattt 
ggtaagtcag 
ccctcaaagc 
attagccatt 
cccctggtcg 
tggaagctgg 
• ttcatttaaa 
ataccccagg 
gacggcatcg 
caacccctac 
agagggccgc 
acaatcctct 
cctcatgctc 
gcaaccactg 
aagttagcgg 
ccccctgagg 
ttctaccgag 
aaaaggcttc 
gaaacgacag 
cgaggtagtg 
agtgggtgcg 
cccctacgta 
aataggtggg 
agattgacag 
caggggcccc 
tccacagccc 
cttggcttcc 
ttttttagtt 
ctctagcccc 
tcattctcaa 
agggctttgt 
catccagtct 
caataccggt 
cattttggtt 
atttgaacat 
aggattggga 
aatactatga 
taaccaaatt 
aattagactt 
gtttttatgt 
tagatatacg 
agtccccttg 
ttttgcttac 
taaacgcagt 



accgttgtag 
gacaaatgcc 
gccccctgcc 
acatcaatgt 
accctggcct 
gcaaaaggct 
tgaaggaaca 
taaaggtaag 
ccatggggcc 
caaagaaaaa 
ccactgcccc 
acttagtggc 
ccaaaagccc 
tcccatactt 
aaaggactac 
tcccgatacc 
ccatgtgcat 
gaccatcctg 
aacaagtatc 
aaaacagaga 
gaaatcccta 
ggcttggcca 
tcaggacagg 
gagaccggca 
tttcgggctc 
tggaacatta 
ggccctaaaa 
gaccaccgca 
aaccggtagc 
tgcggtaaaa 
cagtcaagat 
taagcttcgt 
caattcaaca 
cagtccatcc 
caggatcccc 
agcggtgtgt 
ctaatcttag 
aacaaaaatg 
gagatgctta 
gctatgataa 
cttttaaaaa 
taaggttcac 
gttggcgagt 
tgagagaccc 
ccagtcgtac 
aacacagtgg 
ccaccgcccc 
ctaaatagaa 
ccttattata 
tgcctttggg 
ctgggccggg 
agcaaaagcg 
ctcactcctt 
cagcttattc 
cgggtccgct 
gtagcagctg 
agaactacgt 
agaagaatct 
attattcctt 
tgaccattca 
taaaagagaa 
gctcaccact 
ttttgggccc 
ccaggttatg 



171300 
171360 
171420 
171480 
171540 
171600 
171660 
171720 
171780 
171840 
171900 
171960 
172020 
172080 
172140 
172200 
172260 
172320 
172380 
172440 
172500 
172560 
172620 
172680 
172740 
172800 
172860 
172920 
172980 
173040 
173100 
173160 
173220 
173280 
173340 
173400 
173460 
173520 
173580 
173640 
173700 
173760 
173820 
173880 
173940 
174000 
174060 
174120 
174180 
174240 
174300 
174360 
174420 
174480 
174540 
174600 
174660 
174720 
174780 
174840 
174900 
174960 
175020 
175080 
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gtactaaggc 
tatctcctaa 

cagaacccga 
agaatgcatt 
tagaacattt 
caaataatag 
tgacagaacc 
ttttactata 
gtgcatgagt 
ctattaaatc 
tgtcccgggg 
cagaggaaaa 
agacaagatg 
tgtgtggtgg 
tacctctgtc 
cagataggaa 
ctcaatagga 
gctggccagg 
ctctgtgagt 
aggagcctgt 
tttaatcccc 
attggagcta 
agcttatctc 
cctccacaca 
cagctgcaca 
tattagtttg 
ccttaaaaca 
aagagtactt 
agcaatagcc 
atgagcacca 
ttcagccttc 
aaggctaaac 
ttgtagttat 
ccagtcacct 
ataatggtat 
agccatcgca 
attataaatc 
ccatcctaga 
ggacattgag 
tgttggacta 
gggtgtgtgg 
atttcgaggc 
tgtttcaaca 
taatacatgt 
ttctgctagc 
ggtcctcctg 
ggttttggtc 
catctggcaa 
tgtgagattt 
ttttcattcc 
tgtcaatcca 
ctagggtggt 
agaaaatggc 
gaccttgctg 
ccttaggagt 
accaaatgtg 
aaaatagatt 
aactggacaa 
cataatgagt 
ctcatgtagt 
tgcacgctgg 
acagaatagc 
gaggaagagt 
gttaattgac 



aacaatatcg 
aagaagtggg 
cccctcccat 
cctgaactcc 
ttgagataag 
gacatgaccc 
ccctagtgat 
taagcctgta 
ttcgacccca 
ttgccttcta 
cttgagtgct 
atgtattctg 
gaacagctca 
acaggaaaca 
cgtcaggccc 
ccaagtatca 
gtgtgttcca 
agagctgcac 
tcaaggcaag 
titcaaaacaa 
atgacctcaa 
gtgagatggc 
tggggacaca 
tatgcctccc 
tggtcatgca 
agatcgcata 
caaaaatcaa 
tgtagtaagg 
ttccgagtgc 
gggtacactg 
ctaactgctc 
tactgctcag 
gcagaattca 
cttaaaggtt 
tccaccctaa 
ccagcccatg 
tgttggtgag 
ctgaagacca 
cctcatggac 
cccagacaat 
cacacacctc 
ctggtctata 
aaaccaaaac 
gggtaccagg 
ctagtgaagc 
agctttcctt 
ttaagaggtt 
atgtcttgac 
tgctcctcac 
tgagaactgg 
tcacaccaga 
cactttgctg 
taacgactgc 
cagtctgaag 
attgcttgct 
aagaaatatg 
tattacagag 
atgttggtac 
tagtttgctt 
ctatggtctc 
caatgtaagc 
tctgtgatga 
ctcagtgact 
atggaaagat 



ggtccttcag 
gaatgaagaa 
ctagagattg 
tcaccctaga 
ggcctcctaa 
cttagttacg 
gtaaacttgt 
aaaaatttgg 
gagctctggt 
cattttgagt 
tgagtgaggg 
cctcatgggt 
acctgctggc 
gagagctgcc 
tacatcttaa 
aaccaccagt 
ggatgtggac 
acctttaatt 
cctgggctat 
acaaaaccaa 
atagccaaac 
tcaacatatg 
cataatggga 
acaaataaga 
tgcctataat 
gcaagcagta 
ttaagcaaca 
acaataccaa 
tcagggatgt 
ctctcagagc 
aggcattcag 
catggctgtg 
agagttatag 
ggatgcacca 
tccccaaaga 
gccttgaact 
gtattccttt 
gcagctctcc 
tcagccgcta 
actatgtaag 
tactcccagc 
gagtgaattc 
cgtaaattca 
ggttgaactc 
tgaggcaggt 
tgagagccta 
ctggacttgg 
aaaggagatc 
cactgcagga 
ggaaaactga 
cctaaccaac 
ggctcatcct 
taagaagcag 
tctttaaaat 
gtcacctatc 
gcatcaatac 
tgcaagggaa 
agttcatgaa 
ttttttttct 
aaactcacca 
atgtgccacc 
ttaacttcaa 
gggtgggcta 
caagtccatt 



gaggttgaaa 
taaaaatttt 

ttcccggaac 
gttcgaaccc 
aacaaccgca 
tagattccct 
actttccctg 
ctggtcgtcg 
ctatgttcca 
acggtctcag 
tctcccttcg 
tcagggggtt 
aggagggtgt 
tacagtctta 
aggatctaca 
ttgtagggga 
aaggagaaca 
ccatcactcg 
acaattctag 
accataaaaa 
agctctcctc 
aaagtccttc 
gaggaccaat 
aaatatatat 
ccagcactcc 
ggctagacag 
ataacagtaa 
aaatgttcct 
ataaatactt 
tgcccagaaa 
aggtcacttc 
gtacttggca 
catcatgaaa 
agtgcctttg 
cttctggcca 
cttgatggtc 
gctgataata 
aggagtcctc 
ctagattcgc 
ccaatcccat 
acgcaagagg 
caggccagcc 
ttctatcagc 
aaagtcttca 
acagccggtt 
caaagaactc 
atctgtagct 
acttgtgttt 
ctgcagatct 
ctttgtattt 
tgccaagagt 
catccttggc 
tctttccttc 
gtgtttgtta 
atattctatc 
ccactgcaaa 
aagaaaaaaa 
gaggttctac 
ctttccttcc 
ctgtaaatca 
aggcctggct 
tcatcaactt 
agggcatgct 
gtgagcagca 



actcgctcta 
tactgaactc 
actcctgaac 
tcccaactaa 
aaatgaaccg 
tggcagaacc 
cccagctctc 
attctcctct 
tgtgctttqt 
tgtcttcttg 

ggggtctttc 

tccctcagca 
gggaaaggac 
caggcctacc 
gtttattaaa 
taaaaataca 
cagttgttta 
taagagggaa 
attagccaga 
agcatttctg 
agtccaaacc 
ccaaaaatat 
ttctacaagt 
aataaaaaga 
agaggctgag 
ggctacatag 
caaccacaac 
ttaaggacag 
agaaaacttc 
gttgtttatc 
tgtagtagcc 
ttatcatttt 
gtttccacca 
gggtgataaa 
tctcataatg 
ctgtctcagc 
caagcaaatt 
aatgcagact 
aacctattca 
tttaatacac 
cagaggcagg 
agggctacac 
tctatttcct 
tgtttacgta 
ctttactgct 
tttttttctt 
gtcatatcac 
gctgcagtgt 
attctgcctt 
gggctttgaa 
tctgctgact 
ctgcagttta 
cagaaatttt 
tggtagaata 
aggaagcaga 
aagtgtaaat 
tcagccagtt 
aaaatggctg 
ctttcctttc 
ccttgaactt 
cacacatttg 
gacataacca 
cataagggat 
acacgccctg 



agattagagc 
ttcttcaccc 
tcttcacccc 
aaactgttcc 
ggtacattgc 
ccttgtcccc 
cccccttgag 
acaccactag 
tgctgttgtt 
ggtccgcggc 
aaaactactt 
aattcaggga 
aagtgttcat 
accactgacc 
agaacactac 
aggaacacat 
aaagcttaac 
gcaggttcat 
gctacatcgt 
aggctttggg 
aaactgcaaa 
tgacaactgt 
taccctctga 
aagaagtcta 
gcaggaggat 
tgtaaacctg 
aaaaacccaa 
ttctggaatc 
ccctggagaa 
ctggattcat 
aatgtctaaa 
gtgactggtt 
agttcctgat 
ttatattcaa 
taaaatgctg 
ctgtgtttgg 
cttcaagctt 
ggcccagctg 
gacaagccac 
atattcatct 
cagatctctg 
agagaaaacc 
tagagaattc 
gcaagtttcc 
ccttgcaaat 
taggtctcca 
agacattcaa 
cccttctggc 
tttagttgac 
tttgtccatt 
tttgttttct 
tccccaggaa 
agtctatcta 
ttttgagttg 
cgtcccattt 
aaataataaa 
tcagaattgt 

ggggtgggaa 

cttacaaggt 
ctgatccttc 
gtttttcaat 
agaatcgtct 
tatcctgatt 
aacagaagtc 



175140 
175200 
175260 
175320 
175380 
175440 
175500 
175560 
175620 
175680 
175740 
175800 
175860 
175920 
175980 
176040 
176100 
176160 
176220 
176280 
176340 
176400 
176460 
176520 
176580 
176640 
176700 
176760 
176820 
176880 
176940 
177000 
177060 
177120 
177180 
177240 
177300 
177360 
177420 
177480 
177540 
177600 
177660 
177720 
177780 
177840 
177900 
177960 
178020 
178080 
178140 
178200 
178260 
178320 
178380 
178440 
178500 
178560 
178620 
178680 
178740 
178800 
178860 
178920 
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ttctgaagta 
cgtgtttatt 
aacatgaact 
tatgctaaga 
agctttggaa 
atctgatttt 
agactcagaa 
tgtcattgtt 
ccaggctaga 
tttgagacag 
ctggcctcga 
gtgccaccac 
tgttcctgcc 
tagtaaacca 
cccaaccatt 
tttttcagct 
ggttctgatt 
tgcggtaagc 
ccaggtttgt 
cattggggca 
gatttggcaa 
caacacaaca 
aagagcaccg 
tcacaaccat 
tgtactcaca 
atgtacctca 
cattttctta 
cttgacacag 
ctctggtggc 
agaagcaaag 
tacatgcatg 
tcattaaaaa 
gctttgaaag 
aaccttttga 
taatggcagc 
tagtgttttt 
acctttgtgt 
ggcttacaca 
ccagctcact 
gggttcatgg 
ccatccttag 
ttagtttgat 
aacgagtttt 
agttatgatc 
tattgcctga 
ttgactggtt 
cctccagact 
tgtgatcatg 
gggtggaaag 
agaaatgtca 
tttatatagg 
aactctaagc 
acccttcaag 
tacgcacaag 
acacaactta 
acatgggaag 
nnnnnnnnnn 
nagagagaga 
gctgagtcag 
cgctgtccca 
gctgcctcat 
cagctgggcc 
gtgctttttt 
aatccttgca 



taagaggaga 
ctctgtctgt 

gcaccctgga 
tattttatcg 
ctgaacctaa 
gatggtaaaa 
aagaagttag 
ctgaatgtaa 
atctgaagga 
ggtttctctg 
actaagaaat 
accaggctag 
aatgtaccaa 
gaataactta 
tttggaattc 
tttttcaagt 
agcatttcaa 
agccacacac 
ttctctatcc 
cgtgcaactc 
tgtcaccctc 
cctctcatct 
actgctcttc 
ctgtaatggg 
tatattaaat 
gagtccaaat 
ttgaggttac 
cagacaagca 
acttcctgcc 
gacctactct 
cacatggtac 
gaaaaatcag 
ccaagaagtg 
caaaactaaa 
agtgggatta 
gtgtttattc 
gggtcctaga 
gtaggtggtc 
ggccttcttt 
ccagtgctct 
ctgtggtcct 
catctctctt 
ctacttctgt 
ctcacagtcc 
ccctgctcgg 
ttgttccaat 
gcagggatta 
tggtgctcag 
attccaagaa 
gggacatttc 
ttaacctgaa 
aactaaggaa 
ccccacccaa 
taacatcata 
aaaagagagg 
gttggaggca 
nnnnnnnnnn 
gcctattatg 
aactagccta 
gatttgcctt 
gctaatcagt 
agagccacta 
ttttttttta 
ataggctgca 



aagcttgatg 
tcttgaccgt 
attgcaaggc 
cagcaataga 
ggctgttata 
tcatgtgttt 
gacttgaatt 
gtgcctcagc 
actcctctat 
tgtagccctg 
ctgcctgcct 
gaactcgtct 
atagtcaatt 
tctagccaaa 
tgtgagggga 
tgcaggtcat 
aatgaggcaa 
tcttgcttgg 
taggcctctc 
aaacacggag 
ctcccttctc 
caagaattgc 
tgaaggttct 
atctgattac 
aaataaatct 
gcttcttcct 
tgcttctctg 
gttctttaaa 
atgactgatg 
ccaaagccgt 
acacacatac 
ggctcagcaa 
gatattaatt 
aaatataaca 
gtttcttttt 
atattccaca 
gattgaactc 
aggcttgtat 
gtttaaaaaa 
ttatcttaaa 
aggagaaaaa 
tgaagatata 
tcaacaaact 
ggtgggtcag 
tgcatccaca 
ttttagtata 
atgttcttga 
cactcttgca 
ccagaggaag 
tagacctctg 
aggggaaaaa 
tgttgggggg 
gctggttatc 
tggatgggca 
ccatgaattt 
gnnnnnnnnn 
nnnnnnnnnn 
tcgttggttg 
gctgagacac 

tgggggccct 
tgaggttgtg 
a9999dgtct 
aatgttccct 
agacaggcaa 



agagcaagca 
agatgtgatg 
aaacaaacct 
aatgaaactt 
attcactagg 
gtttctggat 
ccgttttaga 
tgaaaataca 
tttttgtttg 
actgtcctgg 
ctgcttccca 
attacacatt 
gattcctgtt 
gtctgcctat 
atccacagat 
gattcagtgg 
gcagagggca 
aggcttagtc 
tcttaaattc 
tgtgactcct 
cctagggtca 
cattagggct 
gagttcaaat 
ctcttctggt 
aaaaaaaaaa 
cccctgacta 
cttctaccct 
gcagggctca 
atctaaggtt 
cctctgacca 
acagaagtaa 
actttccgtg 
atagttattc 
gagtgtattt 
tagattattg 
gcatgtgtgt 
aagtcatcaa 
ctttggaagg 
agaaaaaaaa 
atcaactgca 
catacagttg 
atcagctcac 
acccaagctg 
aagcctaagc 
ttcacatcct 
tgtgctgctg 
ctgccacaga 
cccaaggctc 
aagaaagtca 
aagtctcaag 
attcttacag 
ggggtagtct 
caaaacaaac 
gattgcattt 
aagagagagc 
nnnnnnnnnn 
nnnnnnnnnn 
cttctaatca 
tgtccaccca 
tgaaatgaaa 
aaaatagccc 
ggtcctttgg 
gctttaggct 
gaatatcatc 



ccaggatcca 

cttcctggga 

caagttgctt 

cataactgcc 

ctggaagtga 

tatcaatttg 

agaaaactga 

ggaagaaagc 

tggttggttt 

gtagaccagg 

ttaaaggcgt 

tttaattaac 

atgtttctgt 

ttcatcagtt 

gctttagaca 

taatttagtg 

gcacagcaca 

tctaaacgcc 

tagacattac 

gcggttccca 

tcacactcca 

ctcagaggtt 

acatggtggc 

acagctacag 

ttgccattaa 

ggcctgagtc 

gctgcctatc 

gactgatcgg 

cccacgaggt 

actgcacatg 

aaattgaaaa 

gagtacttag 

agatttctaa 

gtaagtttac 

tattaattat 

ttctgcttcc 

taggtggtca 

acttcctgtg 

tgtttaatta 

tggtaaaaag 

tcctgcaggc 

agcctctgcc 

aacaacagcc 

acctcgctgc 

gtgtggttct 

tttgcctctg 

ttactgaaca 

gaggaagagg 

tgcatctcct 

gaaagtctta 

ttagacaaag 

acactcctct' 

tgaagccata 

acatacacac 

gggaaggggt 

nnnnnnnnnn 



ggcaagcaag 
gctgtcttgg 
tttcctcttc 
agaacaggcc 
atagggacca 
atgatagatt 
gacattccag 
aagagtacag 
cttgtttgtt 
aactcacttt 
agtgctggga 
aacacccctc 
tatttaccac 
tagccatatt 
gctgtagacc 
gtcatgaaat 
tattgtcaca 
agtttctggc 
caaacatagt 
ttccccatct 
gttttacctc 
ggtgagatgg 
cccagcaacc 
gtgtctgaag 
aaaaaaagaa 
cactcacgct 
ggctccttct 
ggaccagtga 
aagcctagaa 
ccatgtgtaa 
aaagagattt 
tagaaaacta 
attatagcag 
tttttgtaat 
ttattatttt 
ggaattggat 
gcttgcacag 
caagcatttt 
agtccttttt 
aacttttatc 
gatggcttta 
atcacactca 
agcagctcca 
gggcgtggct 
ttcctggtga 
aaacaatctt 
gcactaatat 
ggggaacatg 
gaggtgagac 
aacaaggcct 
gggtccaacg 
tccccaggga 
tggtcagtcc 
aggaatacac 
aaagcaaagt 
nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn 



ttagaaaacc 
ctgtcccaga 
gtcaccagca 
tgcagtggtt 
agcagagtta 
cagtgctgtg 
cctgttttgc 



actctcttag 
gcaaggccat 
ggctccggaa 
cctgggcctg 
acagtcatca 
cgctacttct 
cctcaggcaa 



178980 
179040 
179100 
179160 
179220 
179280 
179340 
179400 
179460 
179520 
179580 
179640 
179700 
179760 
179820 
179880 
179940 
180000 
180060 
180120 
180180 
180240 
180300 
180360 
180420 
180480 
180540 
180600 
180660 
180720 
180780 
180840 
180900 
180960 
181020 
181080 
181140 
181200 
181260 
181320 
181380 
181440 
181500 
181560 
181620 
181680 
181740 
181800 
181860 
181920 
181980 
182040 
182100 
182160 
182220 
182280 
182340 
182400 
182460 
182520 
182580 
182640 
182700 
182760 
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attctgaagt 

attctcactc 

ccacctccca 

gagaatgaag 

tgaacataaa 

gagaaatgtc 

ccccaaacag 

actcatgttc 

gcaccaggag 

tcccacagtg 

cctgggcttg 

gggacgtaga 

aatgccagca 

acagactgag 

agatagtggg 

acaaaaatgg 

tttgtcacct 

gttcccagca 

ctacaaagta 

caatcaatcc 

cttccagagg 

ctccaatccc 

agcatacaat 

gtacagaaag 

tttcctgtct 

tggtgtagtc 

aacaggatca 

ccccggctgt 

caactgagga 

tctggaattt 

gcagaactgc 

gggtctgtgc 

gaaaaaaata 

gaggcaggac 

cctctgatca 

tccattagag 

agtatggtag 

agaagttcag 

gttttgtctt 

tttgctgcca 

atgacgcttg 

actaagtgga 

tcgttgggaa 

gaaagagcca 

ccaaagctgc 

cacaggaggc 

atgaaacgtc 

gggctggcag 

tgcttttcca 

gctggacttt 

ctggctctgc 

gagaggtttg 

ctcaaaggca 

ccaagcccta 

agggggctgg 

attccgttcc 

gaatctgttg 

cgcaggcaaa 

tcatgccagc 

cagcctgagc 

aaaacaaaaa 

caatggatcc 

ttctccatag 

agagttgagg 



ctggcaaatg 

tacttctgcc 

tgctcagggg 

tagaagccag 

gttcagttac 

aagtgctaca 

atacaaacct 

cagcccagtt 

gaaatataat 

actgtggata 

agatctagca 

ggcagaagga 

tttgagaggc 

ttccaagaca 

agagaattca 

agcatgaatg 

acataccaat 

tctgggaggt 

agttctagga 

atcagtcaat 

acctgggttt 

aagggacatg 

atatatatgg 

agatcaacta 

gacatctgac 

cctgtgaccc 

aatcccacag 

ctgtgcagca 

gtgcctcagt 

gtcatttgat 

agaggaaaag 

tttctcttac 

gatgcatcct 

tcatcatgca 

aatcagtttc 

gcctgcatgc 

tatctgttta 

agttatcctt 

aacaaaacga 

agtttgatga 

aactccacgc 

taaatgttaa 

gggaggggaa 

gtgtgccacc 

ccagtgtgcc 

cagcaggagc 

tttccctttc 

ggctcatcca 

aggtcttagt 

actctgctgt 

caaccaccat 

gtctggtcct 

agtgctggtc 

gaggacagac 

agaaatagct 

taacattttc 

ccctctctgg 

acactcatac 

gatgctctca 

tatagaatga 

caaatctgtg 

ctagagctct 

accacatttc 

tttacatggt 



aaagatgtgg 

tgtgccacct 

ctaggaagtg 

cttcaaatct 

atttggttta 

attcagtgga 

tcatcaaagt 

ggctccttct 

agatgtcctt 

cagcccagtt 

ccctaaggca 

tcagttcaag 

agaaacagat 

gccaaggcta 

aggttatctt 

cttgcaactg 

tatacaaatc 

agagatgggc 

cagccaaccc 

catgggctgg 

gattcccagc 

acaacttcta 

gtaaaatgct 

taatgaaaca 

ttcttgcgtg 

aggattaggc 

tgctccactg 

ccgcccagat 

gctcctcagg 

atttccagac 

gggcactact 

aaatcattgc 

cacagtacag 

gcctctttcc 

aggcctggaa 

acgaagctct 

gcatggtgtg 

gcacttacag 

aacaagaggg 

cctgagtttg 

acatgccaca 

aaataaataa 

aaggcaggcg 

aaagctgccc 

accaaagctg 

agccaggttc 

ttcagaccac 

ctctagtgtg 

gtctaattaa 

aaggactttg 

gtcagtgggt 

tgtctaatgc 

cagcatgtgt 

atgctttaaa 

cagcagttaa 

atggtggctc 

ttttctcagg 

atatgaaaca 

gcactcagat 

gatgctgtct 

ggcttaatca 

gctttgcccc 

tattttgggg 

catttttttt 



gatttgaaca 

tcctctcatg 

gggagaagat 

aaactaagca 

aaaaaaaaaa 

tgtggatgaa 

ctcttccaaa 

catgtgagct 

tttaaggggg 

agtagagttc 

tagtggtaca 

gtcagatgag 

gaatttgtga 

cacagagaaa 

ggactgcata 

tggacaatat 

agattcagct 

agagctctat 

tacagagaga 

agagatagct 

acccacatgg 

ctggtctctt 

atatatataa 

accataacac 

acccagtgcc 

aaccactgcc 

taccccgcca 

gcttggtgtc 

gcagtgtgca 

tccagctgac 

gtgactgtta 

ggcctctctt 

gatggcaggt 

ctcaactaca 

agaaacggcc 

ggatttgatc 

gaggtatacc 

taagttcaaa 

gccggggaga 

gtcccttgag 

gcccacgcat 

atcaaaggaa 

agggagatag 

agtgtgccac 

cccagacttg 

cacctcagag 

agcagtgaca 

cctgtggcca 

ggttagcagc 

ggcattgttc 

cttcagaggt 

aaaatgcctg 

ggaatcctaa 

aaaaagtcat 

gagcactagc 

aaagctgtct 

caccaagaac 

ttttttataa 

ggcagaggca 

caaaaagaaa 

ttcctagcca 

ctggtctgga 

accactctgg 

gttgttacag 



cagacttgtt tggccaaaga 



cacggggagg 
ggatgtcctc 
atgttttatt 
tccctacaca 
acaatcaaaa 
ggctgggtct 
ccgacttcca 
gtggggtctg 
tcgcctagca 
tgcccatgac 
gggtggggag 
gttcaaggcc 
ccctgtcttg 
agactttgat 
tgggttcata 
gggcccactg 
gactttaagg 
tacactactt 
cagtgatcaa 
cagctcacag 
tggtcaacag 
aaatcagatt 
atactgtttt 
tccacaatga 
cctgggaagc 
•gaggacgtga 
actgagtgac 
attgaaactt 
cttggataac 
tcccctgcat 
cacttccctc 
acgaggcggg 
ccgccgcccc 
actcaggctg 
ttcagcacgg 
tatctctgca 
gctagcttgg 
tgcttcgctg 
cttatggtgg 
gaatgtgtat 
tggccactca 
ataaccctga 
caaagctgcc 
attacagatt 
gtggagccac 
gctgtcctgc 
gaacaggcct 
caaattggag 
cattccgtga 
agaagaactc 
gggcaccagg 
gttcaatacc 
gctttaaaaa 
tgctctttca 
ataattcaag 
acatgtggtg 
acctgctcgg 
gggggatttt 
aaaaaaacaa 
gaaggagctg 
gagcttgctc 
gcctgcacat 
taatgagagc 



ggaggggagc 

aaagcagggt 

tccatttccc 

actggttctt 

tgttgaacac 

gaaaagagcg 

aagactgctt 

tctgacaacc 

agcgtgcggc 

cacagcactt 

gcattccttt 

agcctggtct 

tcaggaaaaa 

tccaaaataa 

catattctgt 

gtgcatgttt 

ctagcctggt 

ctaaatcaat 

tagtgccatt 

atgtctgtaa 

gcatgcacgc 

cacaaatcaa 

aaaagctacc 

gacaagcaga 

cattcctagt 

agcctccctt 

catcggagcc 

gcatgtgatt 

agcacagagg 

gattatagaa 

ctttgaagtc 

ttccgggact 

tgcagagttc 

gggatgtggt 

gcataagccc 

cagggagacc 

gcaacatgaa 

tgctgagtca 

aaggagagaa 

acacacacac 

aaatctacca 

tatgaacacg 

cagtgtgcca 

tggccaggga 

aaacctggaa 

agagtctgga 

cagtcacagg 

agagaagggt 

tcaaatacca 

atcctttttt 

ttaatgtcaa 

catcagagcc 

aattctgttg 

gaggacccag 

tcctagaggg 

caaacataca 

atgtggtggc 

gtgttgagcc 

aaaaaaacaa 

gctccagcaa 

tagaaaggaa 

ggaaaaatgg 

tttgaaatga 



182820 

182880 

182940 

183000 

183060 

183120 

183180 

183240 

183300 

183360 

183420 

183480 

183540 

183600 

183660 

183720 

183780 

183840 

183900 

183960 

184020 

184080 

184140 

184200 

184260 

184320 

184380 

184440 

184500 

184560 

184620 

184680 

184740 

184800 

184860 

184920 

184980 

185040 

185100 

185160 

185220 

185280 

185340 

185400 

185460 

185520 

185580 

185640 

185700 

185760 

185820 

185880 

185940 

186000 

186060 

186120 

186180 

186240 

186300 

186360 

186420 

186480 

186540 

186600 
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tcacaaaagg 

aaccgaatcc 

ttatggcaac 

tgcaaagtct 

gtcataaata 

ctaatgagga 

ttctaatata 

taaaatgata 

atccaaagca 

tctcatccgc 

taggacagac 

tattaggctt 

aaaaagcaac 

ttgcataatg 

gaacccggca 

ttgcagggag 

gacctgactt 

ggattgctct 

agcatggatc 

tgcataccag 

aaccacccaa 

ggattcataa 

tacccaaaat 

tttggaaaac 

gcaccaagac 

gccgctcacc 

cacagtcgca 

tttgtgaaac 

gtaattcaaa 

tgcctactga 

taacttccag 

gacccccatc 

agggtcttgt 

gcattattta 

catttcctct 

tatccttgat 

tttctgggtt 

accaacatga 

acaggcatgc 

cagagtaatt 

aatgagttct 

gggtgatgtg 

tggggagctg 

ggtacactcc 

catggcgggt 

aagtttgaca 

ccacaccacc 

caacaagaga 

ttgtgaggaa 

ttagtcggtg 

gaacaccagc 

ccgaattcca 

tggcgctcag 

tatcccctgt 

ctcccctcct 

gaccccctcc 

tttatcaagt 

gtttctattt 

actgagggtc 

accacgcacc 

ctcgcccaaa 

aaaagctacc 

acagggcggg 

tcaatattta 



aaaataggaa 

taaaaggaat 

tgacatttaa 

tgcaggaact 

tagcttattc 

aatatttcca 

atttatcaat 

aatagcaata 

gtgtagctat 

agatacagtc 

gaaggggggc 

tgttataaga 

tacttggggt 

aacaattttc 

caggctcata 

agaaagcagt 

taaacagttt 

caaagactct 

tccggccctt 

gaggactgaa 

actcatttcc 

aataaggcag 

ggaactgcat 

tggcacagct 

ccagggctga 

aatcccggct 

ctatgctctc 

tgttttcgaa 

actaattacc 

ttgtgcgctg 

gctaaatcag 

cctgacttcc 

agttctggct 

atttaggcta 

tcttcaattc 

cgtcccaccc 

tgcaaggtgg 

gcggcgcaca 

acacaccggc 

aggcctaggc 

cttcctttgg 

gctgaaacca 

cagtcggaag 

cagaaataga 

ttttagacat 

gagtgatatt 

gccacccaag 

gggagccggt 

ttgagtgtac 

aagcaggggt 

tcgaggaaca 

cagggcttaa 

caatctcatt 

atctgactca 

cctgcctatg 

actggactct 

gttctgttgg 

cctttacatt 

ctggtagttt 

aagcaaaaat 

gccagattta 

acttggcaga 

atggggcggg 

acctaactgg 



aatatgcctc 

gttcaagtaa 

aagtaacagg 

tcatgctaaa 

ccaaatcctt 

caaaaaccca 

ttaaacacat 

gtggtgaggg 

atttaaatac 

tagactcaga 

aataagaaat 

ttacatcaaa 

ttgcgcctgg 

attcaatcag 

gtttttatct 

cgggaggcat 

ctctctggag 

cggactcctg 

ctcactgctt 

gtttggctcc 

gacttggttg 

aggcagagaa 

tttcatgcac 

gacattggct 

ggcctaatcc 

ggagccgcag 

ccgtggggag 

aacctgggat 

ctgagcacat 

cagctcactc 

gcagccacag 

aggcgaggag 

gccgaattat 

gaatgacctc 

gtggccccca 

cacttggcta 

tcaggtccac 

gccgccatca 

agacatgtgc 

agttcctgaa 

gggtgctaaa 

tttctgcagc 

aggcacaatt 

cagatgaggg 

ttaggaatat 

accgggttta 

ttatcacatg 

ctatctttgt 

aacactccaa 

aactgcgctc 

ccaaaaaggc 

gtattaagcc 

tattatttct 

cattctcctg 

catgctttgc 

ctcaccccac 

tgagccttct 

tcttgtagcg 

tagagttgga 

tgcacccacc 

ttgctcctgc 

agatcaagtc 

tagggtggag 

taatttgctg 



ctaaaaagag 
aaaaaaaaaa 
atttgggcaa 
attatgcttt 
agaatttcta 
cttaccatta 
tgcataatgt 
gtggggggca 
ctcaatccat 
gcagcatatc 
gacaggaaac 
gcagttcata 
gggctgcctc 
gatctcagca 
cccagctcca 
tgtcctagtg 
gttgaaaggg 
ttacaggcaa 
tcttgaatca 
cggtgaccag 
gtcaaatttt 
acggagggtg 
aatagaaaac 
agaggaagga 
gcctttatcc 
aagagctctc 
gccgctccgg 
gatcatttaa 
ttgaaacatt 
tggtgtttct 
gcgctgcctc 
gcggcccgtt 
tgctcttatc 
tttccctccc 
ttttctgatt 
catcttcatc 
cagtcatcct 
ctgagagagc 
acatgcgctt 
gcaaattcat 
ccagcatgcc 
caagcctgtg 
ctgggatcaa 
ctgcctcctt 
aaaagtaggt 
actagagcca 
agccataatg 
ccaaaggaaa 
taacatcccc 
gagcagtctg 
cgattaatga 
ccaaagaaat: 
ctacaaagat 
ctcagtaagc 
gtcttcacaa 
atacttggaa 
cttgcattaa 
tctcacatgg 
gttagatgac 
cttccctcag 
tgtaaagtgt 
agaagtgcag 
cgaataattg 
tgacaattac 



ccgaggcaaa 

atatggctca 

gtttctttct 

aattttaatt 

ccctgcaagg 

agaaccccca 

gccactctgt 

aaaaccagga 

tgtagaggaa 

cttgaqtgta 

ttcagaagaa 

tgttttaatc 

tgtgtactga 

gagatagctc 

cctgctggag 

gctgtgtacc 

gctctgtaag 

gtaaggtcct 

gggatttaga 

aggacaaggt 

caagtttccc 

tgtgtgtgtg 

ttaaagactg 

acggccaggg 

gagggtttag 

ttcacttggc 

gagggggagc 

atgtttaaaa 

tatgccatca 

ataaactgct 

cagccctggg 

tctccagaga 

cgtgttcata 

gagtcttcct 

ggtccaaata 

tgggagccaa 

aaggtgtgtg 

acgtgccctg 

tccccagcaa 

ttcccccttt 

agtggctaga 

ggcagaagct 

gaaatgagca 

attagcgctt 

tggattccca 

ttaagagact 

caagagaatt 

tgagcagccc 

tgcaggattg 

cctgtgtacc 

caaaggacac 

caaggtctag 

ccaacactca 

catcctggtt 

cgacagctgg 

ctactccttg 

agctgtgaga 

gagacaccca 

ccagcaacat 

atgttcctgg 

atcttctcta 

gctagcaggt 

aagctccaag 

gccatgaagg 



ttattatagc 
tgcgaagttg 
ccaccctctc 
agccaaatag 
agctgaacta 
tccgatattt 
agcattccat 
gattaaacat 
aacacactgt 
agggtattaa 
ataaaatttc 
tggggaggaa 
accagacagt 
ctactcaaag 
aaaccttgta 
taaagttaca 
ataccagagt 
agcagatggt 
aattgctatt 
cattgtttaa 
agcagtctaa 
tgtgtgtgtg 
aaccaatcat 
cgagccagct 
tgaggctccc 
tcagtcccag 
gacatcaagc 
tatgcacatg 
tcttggatcc 
tcagcgattt 
ttggtggaga 
gccgtttgtc 
attctcatct 
ccctcattcc 
tatagacaaa 
tgtggtgagt 
agagaggtag 
cagctcaggc 
accctgcttg 
tccagaataa 
agcctgagat 
aaccttgggc 
ctggtttata 
tgaagatgcc 
cagtcagctg 
cttcattatc 
ttcattccat 
agcgtgaagc 
cctctgcgat 
tggcttgcaa 
tcatagaggc 
gccattctcc 
atttcccagg 
tgaaacgggc 
taatttgcaa 
gaactacttg 
aggaaccaca 
ggttagatat 
gccttccccc 
catcttataa 
agcctcactt 
gacggtgagg 
agttaccagc 
gaacgctgcg 



186660 
186720 
186780 
186840 
186900 
186960 
187020 
187080 
187140 
187200 
187260 
187320 
187380 
187440 
187500 
187560 
187620 
187680 
187740 
187800 
187860 
187920 
187980 
188040 
188100 
188160 
188220 
188280 
188340 
188400 
188460 
188520 
188580 
188640 
188700 
188760 
188820 
188880 
188940 
189000 
189060 
189120 
189180 
189240 
189300 
189360 
189420 
1B9480 
189540 
189600 
189660 
189720 
189780 
189840 
189900 
189960 
190020 
190080 
190140 
190200 
190260 
190320 
190380 
190440 
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actatgcaag 
ttaataacac 
cagcatatgg 
ttcaattcgt 
catttgactg 
gcttcccaca 
aactttaata 
gagcgtgaga 
tgagacaaaa 
atgggagttt 
ttagtggttc 
tatggagatg 
gtagaagtgg 
tgcaccccaa 
aaattttagg 
tcgagttgag 
ttttttttta 
tggctccctg 
aaccaccgaa 
tacaagctct 
aattaccgac 
ttaaaagttt 
attagggact 
aattatacat 
aggttatgaa 
cagctccaag 
ggatcactta 
ccatcttccc 
gggtctctcc 
tttttttttt 
aactcacttt 
ctcctgagtg 
tgaagttccc 
tccggctcag 
aacccccaca 
gcaaggctac 
tgtcggaggg 
tgtcctttgc 
gccttggtgg 
atgcctccct 
atttacctgc 
agagaccctc 
gcccatcaga 
gtgattcagc 
gccactttta 
gggtatataa 
cctcacgcct 
actaagtgtg 
ggtgcctggt 
gttatgtcac 
tgctctcgta 
gaactgtaag 
aaagtaactt 
ttgtgtctgt 
ccactgacat 
gtcattcaga 
cctagggtac 
ttgtggatta 
acatcaacag 
cttgaacaaa 
tccggccttt 
ttatttcatt 
ctaaggttac 
tgagatgaag 



aatgttgctc 
acagaatgag 
ctgtgcacat 
cctcagatag 
acaggcatct 
gcccggcttt 
aggattttct 
caggaatggg 
gaatattaat 
tgaggcatgg 
atttatggag 
tagcttataa 
tgggggtggg 
gcctatgttt 
ccaatagaaa 
cagtggtgtc 
aagtccactt 
cagcgagagg 
agggaactgc 
gagcaagcta 
tgggtaatcc 
acacaatcac 
ttttgggtgg 
tgagatttgt 
atggccaggt 
tcataagctc 
ggctacccac 
aacatggctt 
aatgtggact 
tntttggttt 
gtagaccagg 
ctggaattaa 
agtgacctgg 
aattttgggg 
ttgggagagg 
acagcaagaa 
tgtgaggcct 
tccattactt 
ctgcaatctg 
ccccaccctc 
cagcctttgg 
acagaagcac 
agaacccagg 
cccataaatt 
ttgaaataga 
gagataatcc 
gcctgcaagg 
gaggtatgga 
tggtggcacc 
tgggtatggg 
aggttcctgc 
ccacttactc 
gtgtggcagc 
ctgtctgtct 
gtctcagtcc 
ctggttttga 
tcgaccagac 
tttgtgtata 
tatttagcat 
tgaaaatcaa 
taatacagtt 
gcttaaccag 
ccaaccacgc 
aacaatcaac 



tctaattaag 
ccttggctcc 
aggccgtgag 
agcccagaga 
tcccggaaag 
caaccaggaa 
cacagaaaga 
gacacaagcc 
atcttggcta 
ctaagctggg 
ggcttgctgc 
aactactcag 
gtggggcaag 
agggttgatc 
tgacctctgc 
aacggagaga 
gcttgggact 
ctccagtttt 
acagcacaca 
tctgaagaag 
actagggagc 
ttttgagttg 
tttactcgag 
caactgctgg 
aaggttgggt 
aaacaagtct 
ggacaiagcac 
ctatgttgct 
caaatcatga 
ttcgagacag 
ctggcctcgg 
aggtgtatgc 
atgtcaactg 
ggacaaggta 
cagaggcaga 
gctgtctcaa 
tgggggccag 
aatcagaatc 
catttaagag 
cctgcatccc 
tgggacacag 
ctgaatgtac 
cttatgctaa 
tacacatccg 
aaagatgcct 
catgttgtct 
tagccaccaa 
taaaaaaaaa 
gtttggggag 
ctttgagagt 
caccatgttt 
tcaggtctct 
cagctaagca 
gtcattccat 
tcttggtatc 
gcttgcaatc 
tgggagtagc 
aagatctgat 
caacttaata 
acagcagtac 
cctcgtgttg 
agttaactgg 
taggaaggag 
ataacaggga 



agggctctgc 
gggagctaaa 
tgatgcagcc 
gcgcggctca 
cctgcgcgtg 
ggcttggcgt 
aaagtccatg 
aacaccccaa 
tgaaggagga 
caaaccattt 
ccggagagcc 
attttaaaca 
agaacaattg 
agcttcccga 
accacggctg 
gcagaggaaa 
cacctgaagg 
cccaggcacc 
agttaaatat 
ctgtcatttt 
aggtagtttt 
actataagta 
gctgcaacca 
ggagtagtgg 
gcttccaagt 
tcaaggggcc 
ttctcataca 
tcaacaacca' 
ctacagcntg 
ggtttctcca 
actcagaaac 
taccacgccc 
aagttggatt 
ccttgattta 
ggcagaggca 
aaccaaagac 
aacaaggtgg 
gccatcacag 
ctaagtggga 
tccatttacc 
gcttagtggc 
tttcagcgct 
ggagccagaa 
tacagccaaa 
attctggagt 
ttgatgtggt 
gccacaccca 
tgtcccaaga 
gcagtggacc 
ttgtagcttt 
cctctgccat 
ttcagtcctg 
agggctgtgg 
ttatatagtc 
atatattcac 
ctcctgcctc 
aggttctgtt 
ggcccgaccg 
aactcatttg 
ctgttctcct 
cggtgacccc 
aagggttaat 
aggaaagggc 
cagagcagtc 



atttcctagt 
ggttccatta 
cagttaagcc 
ggccctcacg 
cctacactgc 
gggtctgatc 
ggaacaaatc 
ttgctaggct 
tggtgccatc 
tctttttttt 
catcagaaga 
aacagtgcag 
catctgcaga 
ggcaagccca 
actgaagcta 
gtccaatcag 
cagggcattg 
agcccatcgg 
aggctgggtt 
taatgacggc 
ggaagaacag 
tttcacacga 
acaatgagtg 
agggtcctgg 
ctcaaatata 
tggagagtta 
aggaccggct 
gggcagggtg 
gggttttttt 
tatagccctg 
ccgcctgcct 
ggccgagtcc 
ttactgtgat 
actgggcact 
gaaagttggt 
atctttcttg 
tcaaggaaga 
atatagctag 
taaactcagg 
tgtttccagg 
ttagcgctgc 
gcagagcacg 
agtagaagca 
cccacttgaa 
gctaagtggt 
gctagggaga 
caacctctat 
cctcacgaat 
atttggtctt 
gctccccttc 
tatggacacc 
gagtcttatc 
ccgactgctt 
ctgagaattg 
ttaagacaag 
tgcctcaagg 
ctcttagctt 
actcccttcc 
gtaaagccat 
agagcagcgg 
cccccccccc 
aataaaacca 
cactcgcaca 
cttgtaacaa 



cacccgcact 
ggagcacggg 
cgctaacacc 
ccacgagccc 
aaatggacct 
cttcaagagt 
ctcctcttaa 
aactctgata 
ttctgaattg 
ctcttcttaa 
gagctcgctt 
gaggccagag 
aggctagccc 
gaagcctcta 
taaataagcc 
agcttcattt 
agtagagcct 
ttggttacct 
atctgcattt 
acaaacttcc 
ttcaccatta 
ggcaggtggg 
ttttctcaag 
taatgcagaa 
ctcctaaggc 
agacaaataa 
acctccaaca 
aattaggggt 
tttttttttt 
gctgtcctgg 
ctgcctctgc 
gtcttgataa 
gactactgag 
acacgactgt 
tggaggctag 
atccaaatcc 
ccactgactc 
gagattttaa 
ggtgggccca 
gatctgctta 
tcggggcacc 
cacggctcag 
gctggcaaga 
gtgatccaga 
acaggagggt 
taacccagga 
ttatacacac 
ctgcaaacat 
gcaggaggaa 
cagttaactc 
tggtcctcta 
atagcaatga 
gggattatgg 
aaccacttta 
atctcattaa 
cgataggatc 
tctacagtga 
ctttaagtga 
ctccccacct 
ctctcagcct 
agccgtagaa 
gtctgggaga 
aacctgtctt 
gtgcaaagga 



190500 
190560 
190620 
190680 
190740 
190800 
190860 
190920 
190980 
191040 
191100 
191160 
191220 
191280 
191340 
191400 
191460 
191520 
191580 
191640 
191700 
191760 
191820 
191880 
191940 
192000 
192060 
192120 
192180 
192240 
192300 
192360 
192420 
192480 
192540 
192600 
192660 
192720 
192780 
192840 
192900 
192960 
193020 
193080 
193140 
193200 
193260 
193320 
193380 
193440 
193500 
193560 
193620 
193680 
193740 
193800 
193860 
193920 
193980 
194040 
194100 
194160 
194220 
194280 
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gagagagagg 
aacacaagtc 
attaattatg 
tatttatact 
caatttgacc 
aatggaaggc 
tgccttttac 
tagagcccct 
attgagctct 
gtccaagaga 
aatccctcac 
tttttgagta 
ccacgggttt 
ggattggttc 
tgaccgaatg 
ccatctctaa 
aacgctttgt 
gaatcaagtg 
tactaaacca 
aaagatgcca 
ggagacagaa 
aagctaccca 
cttgtatttc 
acatagggaa 
agggtgcatc 
gtgagggccg 
tgggtggccc 
gtttccaagt 
ccagtatcaa 
acacacagag 
ggctgccaca 
cagcagcagc 
agcaactgaa 
tgcacacatc 
gcagagatgt 
ttaggaacac 
gttttcttcc 
taaacatcaa 
ccttttgtgt 
atctccaggc 
tatttattat 
aagtaggatc 
tcttaaattc 
ggtaactgat 
tgtctgcagg 
caagtaactg 
ttccccaagg 
taatatcaga 
cagatattaa 
caaccatgca 
cacctgggtt 
cactggaaaa 
acttgtgaca 
aggcaggagg 
acacacagcc 
tgggttgcca 
aagccattct 
tcacatgttc 
cttcacgtcc 
tttgtgagtg 
tcaatgagtc 
tcctggagtt 
tcagacctag 
agcagacagg 



ctgagtttct 
aatataaacc 
tagtcaatag 
tttcgaagac 
ctgtagttca 
ataaaacact 
aaagagactg 
ccctcccctc 
aagtgataga 
attcaaagaa 
aggcgttaac 
gggaaatctg 
ctacatgcag 
agcgacctca 
ctgtgagcag 
cctgcaattt 
aagcaaggtc 
taactcaatc 
gtgagcctca 
acagctaagg 
acaggaggat 
cttgaagact 
tagtgtatga 
gatggtgcct 
gatctaaatc 
ttccagagat 

aggggtccag 

attaggacca 
gcttcaggcc 
cactagcatt 
gtgagagtcc 
catttctcca 
acatggggtg 
acaggccagg 
gcagcagatg 
actgagcagg 
tcgatcacgg 
tccaaccccg 
tttcattcat 
ctcagtctaa 
gaaggctaat 
cacaaagata 
tatttaacat 
gcaaactgaa 
ctttgtcgga 
accccactat 
ggacagcaag 
gtcatctgac 
cgataggcct 
gtaactctgc 
ctcccctact 
gcacactcca 
taatttctgt 
aggggactgc 
ctctgagctc 
gcctccccta 
atttctgttc 
ttcctgatgt 
atcacagtgg 
gcagagcctg 
cagcaggaag 
gggccagatc 
catcctcccg 
taactcaccc 



acttctataa 
tttagacatg 
ccatgggttt 
aagccactca 
aaaaaagtca 
ggaaatatgg 
gagggaattc 
agctgcttaa 
tgagaatcag 
ctgtgggcca 
agtggaacca 
ttaaacatcg 
ctgtgctaga 
gaggacattc 
gagatatcaa 
gtcaatctca 
agatagaggc 
cctatctcct 
agcaaagcct 
gccagggatg 
caggacttca 
gtccccaaca 
gacgaaggaa 
caaaaacagg 
cacatacctg 
ttaataaaga 
actgaataaa 
cagcctggcc 
agcggcaggg 
cacctcctgg 
cctaccttcc 
gtaacttgct 
ctgtgacccc 
cctgaagatg 
ccaagaggtg 
catgctcttg 
tctccacgtt 
aggggccaga 
ttaaagatgc 
tctctgagtc 
taattgcttt 
tactgtttat 
agctttaata 
gccatgctct 
tttactaaga 
gtggataatg 
ccagtgctta 
tcagtctatt 
ttgattaata 
aaaacagacc 
cttctccccc 
ggttacataa 
ttgaagataa 
tgccacacta 
actgtctttg 
gaggatcttc 
ttctctcttc 
tcgcgatcag 
ccttaattcg 
gcgtgcagcg 
aaggaaagtt 
ctcttccaca 
gagaagcctc 
taaagcatca 



ataaaccctt 
gggctgccaa 
cattagcgta 
gggaaaaaat 
gaacagcaca 
acagaaatca 
agaaactatt 
cactggggtt 
aacaggaaca 
gaatctactt 
atttccaagg 
tcccacgagg 
tctgctgaag 
ttgttcacta 
aggccggcta 
gacagcaatg 
tccataaaaa 
gagattaggg 
gtctgttctc 
tagtgcaggg 
aggtagtttg 
aaacaaacaa 
gaaagctcaa 
acagccgagg 
gatttaaagt 
cccaccctga 
gggaaaatgg 
cctggcatgt 
cactggacag 
tctcttcttg 
tcaccgttaa 
ttccacagat 
ttggcaccac 
ctgggggact 
ggctgcagca 
ccgaatgaaa 
tcagagttgg 
tcatcggtgt 
attccagggt 
tgtaatgagt 
ccagttacaa 
tcaaacaaag 
aaggtacaca 
gtagcagcct 
ttctgttatc 
aagtaaatta 
tcagccgtcc 
aaacctatca 
attctacctt 
ctttgattcc 
caggaggaac 
tttgcctcat 
caaaatttca 
ccggtggctg 
cttatcagtg 
attgtggagc 
cgttctggcc 
ccgtctgcca 
agattcatcc 
gaagagagaa 
tacaagtctc 
gaccctttcc 
tgtctttcta 
ctttcatcta 



ggcaggcgga 
acttcacttt 
ttaaatacca 
ggtgggggga 

ctagagatta 
gatccctgcc 
taaaataaag 
gtggtggacg 
gtgtttttga 
aggcag.tcct 
cagccctgct 
gagcccagct 
tggccggtga 
gccctcgtgc 
ctggactgaa 
aagactgtga 
ttgttcaggg 
aagggaagga 
agcaaggtga 
tgctgtgata 
ggctataaaa 
gctgggtatg 
gcccgagacc 
agcagacaga 
aatatctggg 
ctgagtatgg 
gaggaagttc 
gctggccagc 
ttcccacaca 
acaacagata 
ggatggtaca 
actgttatag 
aaagccatgg 
gcaatgctgc 
accagagata 
agcctcgcag 
cttggtgtta 
tcctgggctc 
tgcaaacatt 
taacatcttt 
gaatccttta 
caaaggaaac 
ggtccgcctg 
ggatgtccca 
ttcaaacagg 
tgcaatttgg 
tcagaggaga 
aaccctgaag 
gttgccattc 
aggcagacgc 
tcaagacaaa 
tatccagagt 
tgaaatccga 
agaactggag 
agtcccaaga 
tgtcccatgg 
accagtggta 
tagtctctga 
tttagaaaag 
ctttctttgc 
agagagaaag 
ccatcctagg 
tgggtgcagt 
gaggagctct 



tcactaaagg 
tcgacagtat 
cgatcaatat 
ggaggaggaa 
gcaagggttt 
ttcatttttc 
gcaaaatgat 
caaaataagc 
ggcaaaatat 
ctgggacccg 
ggtgatctga 
ctttcactcc 
ggaggtgtgg 
actggggcga 
aactagatca 
ttttctagtc 
ttcaggcaga 
aggctgtgtc 
gccacccacc 
tcaacagctg 
tataagcttg 
gtggtcgatg 
tgcttgggtt 
cagggcagac 
agactggtct 
gcaacaccca 
agctggtagt 
tagtctagct 
cgacacacac 
aaatgtaact 
aactgtgagc 
cactaagaaa 
caagctgaag 
ctggattctg 
attaatatga 
tgtaatgact 
ggctgccgcg 
aatcgccttt 
agtgagaatc 
ccctagtgaa 
cagtcaaaga 
aaagcttctt 
gcaaccgaac 
gtgccacctc 
gattgtgtct 
gggtttgctt 
caattctgat 
gaaggatatt 
taagcattaa 
accctctgaa 
aaggtgccac 
ggggttaatg 
caaagccgga 
cggaaggttc 
ggggcccaga 
ggcgggaagg 
cttgctccca 
agtccacggg 
agagaagctg 
ttcagtggct 
tgctgtgact 
tgccctgtgc 
gggggcccag 
gtggtagtag 



194340 
194400 
194460 
194520 
194580 
194640 
194700 
194760 
194820 
194880 
194940 
195000 
195060 
195120 
195180 
195240 
195300 
195360 
195420 
195480 
195540 
195600 
195660 
195720 
195780 
195840 
195900 
195960 
196020 
196080 
196140 
196200 
196260 
196320 
196380 
196440 
196500 
196560 
196620 
196680 
196740 
196800 
196860 
196920 
196980 
197040 
197100 
197160 
197220 
197280 
197340 
197400 
197460 
197520 
197580 
197640 
197700 
197760 
197820 
197880 
197940 
198000 
198060 
198120 
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ggactgaggc 
tccatcccag 
ccagcctacc 
ggtgataacg 
cacttgtctc 
cacaagtcta 
ccgctgttta 
ctccatggct 
aagcttgcaa 
ctagatagat 
ctggacatca 
tttgtttaat 
annnnnnnnn 
nnnnnnnnnn 
ctcctgctgg 
cacagtgtca 
gaactcgagt 
gattccctgg 
tgtaccgatg 
gagaacagca 
gagggggggt 
aagttccaca 
agaaacagat 
ttttctcctt 
tatatctaag 
cattatggat 
agaacagtca 
tgagttaggt 
caatgatact 
ttgctggtct 
gttatgtacc 
gtttctctgg 
ctcagaaatc 
cccagtttaa 
ctggatgttc 
cctctgcctc 
ttttttaatg 
cccagagcgg 
ttttaaaggc 
cacacccgta 
tgctggtctg 
cagtttgagc 
gcagagaagc 
aacaggaaat 
cttctctact 
aaattaaact 
gcctggttct 
gcctcactag 
tggcctgttc 
cagaaccaaa 
ctcttcagca 
acgtgcggtt 
tggagctcag 
tgcggagggg 
ctgcacccta 
cactgcgagg 
aatggcccag 
cccttcctca 
tgatgcccgg 
gcgttggctg 
gggtgaagca 
gactcgagtg 
cgtgcccttt 
ggctgcccag 



ttctgctcca 
gaagacagaa 
tatgttcatt 
cagaatgcac 
cttgaactaa 
gctgggggcc 
caagtatggt 
ttaggttcag 
caacagctcc 
ggggcaggac 
ggattagttt 
attctgagtt 
nnnnnnnnnn 
nnnnnnnnnn 
ggttacaggc 
agctggatat 
tggt:gaaata 
tcacatgctt 
catgcttgca 
agtgaacaaa 
gtaaagctct 
tgtaaggtca 
gcagtctgtc 
cctttctttt 
tacactgtag 
ggttgtgagc 
gtgctcttaa 
tttgtgtagc 
ctgcctgatc 
ggaactaaca 
accacacctg 
gtagtcctgg 
cgcctgcctc 
gggttttttt 
tggaactcac 
ccaagtgctg 
tgtgtatttg 
aagtgttctt 
accaaactct 
cacacccgta 
gcctgtgtat 
acaggctctc 
atctgcaagg 
ctcaggagag 
tcccagaaca 
ccttggtttg 
gctgctggca 
atccgcgtct 
aggatgatct 
cgagcctcgg 
ggggtgggga 
gtttccgtgt 
cgctggtgct 
taagaaaacc 
ccccccgctc 
agggccgtgg 
gggagcagaa 
ggacctgaca 
cagctggccc 
gctagtctcc 
gaagtcagga 
gaggtgcatg 
gaggaggggt 
aattaccgga 



gctctgggca 
aatccctaca 
taatacaaca 
aaactctgcg 
acagtgtctt 
ttcatgcttc 
taggagcccg 
agtcattctc 
aacacagaaa 
ggactgtcca 
aaacacttgt 
ttagaaccta 
nnnnnnnnnn 
nnnnnnnnnn 
ttgagccacc 
gatggcacat 
gtccaccagg 
taaggaaaga 
gccacccaca 
caaacaagcc 
tggcaggtgg 
tgtgatcgag 
atatctaaac 
ttctctccct 
ctgtcttcaa 
catcatgtgg 
ccgctgagcc 
cctgggtggc 
tgtcttaatc 
gagatctgcc 
gagtttaagg 
ctgtcctgga 
tgcctcccaa 
ttgtttgttt 
tctgtagagc 
ggattaaagg 
tatgggtgtg 
aaccgctgaa 
taggtaggag 
cacaccacat 
gagctggaac 
tggtttctgt 
ctgtgaccac 
tatttccttt 
ttgtgtgcca 
cagagtagca 
gagctgacac 
cagagtctgc 
tcctctgcat 
acgaagcccc 
caagagcaag 
ctgggagatc 
tcgcgctccc 
cccaggcttt 
ctcgttcatc 
ccaggctcag 
ggcggaaagt 
tggaggagct 
tggcgccggg 
gggagcgccg 
ctttactgca 
ggctgcggaa 
cctgtagccc 
acctgtggcg 



aggttacttc 
ctctcccttg 
actaaaatat 
gctgcagggg 
tgaggcagaa 
catgtgctta 
actgcctggg 
tgcatgcctc 
gtgctgtgag 
gtaagcaggg 
caggtggggc 
aactgtggga 
nnnnnnnnnn 
nnnnnnnnnn 
atgacagctt 
gcttgcaata 
taaaagagct 
acttgccgaa 
cacccacaca 
gggggggggg 
ctggctcctg 
tacatctggg 
cattgttgtc 
ttctctttta 
acacaccaga 
ttgctgggat 
atctctccat 
gttaccttaa' 
attttgagat 
tgtttctgcc 
gttttttgtt 
actcgctctg 
gtgctgggat 
tttcctgaga 
aggttagcct 
cgtgtgccat 
tgggtgcttg 
ccatctctct 
aetata caca 
gaccatgcct 
caaaaccttt 
tctctgtcct 
gctggctggt 
taagaacgcc 
agtggcaagt 
tgggagcatt 
ttggctaaag 
aggagaaatc 
ttaagggcgg 
ctaaaggcag 
aggcgggatc 
acatgacccg 
cgccctgctg 
ctttcctttg 
ccagtcttcc 
ccttgcgctg 
ggttcttaca 
gctccggagc 
gggcactcat 
actgggaccc 
acgttcggta 
acggagactg 
tgaaactctc 
ccatgcatac 



tctgctcttc accattcctg 
atctacccga ctttctgaca 
ctattcacag gcactaagct 
agacggcaga gttcctcctc 
cagggtgaca cctagggaca 
gtaattaatt actacatgca 
ttggcctctc gcctctgcca 
tgcctgtctc tccgttggta 
ggtcgacagt ggatagatgg 
ttcatcatgg ctatgcagct 
acttttacca gcacgtgcta 
aacaagagtc cacacataac 
nnnnnnnnnn nnnnnnnnnn 
ngccttgaac ttgcttctgc 
tagcaatagc tttgtaaatc 
ctaacctcca aagattccct 
tgctgcccaa gcctgaatct 
gatgtcctct gagtgccacg 
agtgcactgt ctcacacagt 
ggattgtgac cagaataatt 
gtaacactcc ataagtgggg 
cctccaacag tccttgnnga 
gtatctctgg gtagtctttc 
aaaaattatt tatttattat 
agagggtgtc tgtcagatct 
ttgaactcag gaccttcaga 
cccccaaccc ctttctcttt 
ctacactggc tttgaacttg 
agggactcac tacatagcct 
ttgcaaatgc tgggaataaa 
tgtttgtttt tcgagacagg 
tagaccaggc tggccttgaa 
taaaggcgtg tgccaccacg 
cagggtttct ctgtgtagct 
tgaactcaga aatctgactg 
cactgcccag tgattttttt 
tggaagccag gtgtcagatc 
tccctcttcc ctaactctga 
cacacacaca cacacacaca 



gagcacacaa 
gtcgggagat 
gtgtcgcatc 
gctctgccat 
agacttttgt 
tattaaccaa 
gagagggtgt 
ggctggcatt 
agagagggga 
ctggtttgcc 
taggagagac 
tcgcccggcc 
catcagctga 
cgccccggag 
tcgctggttc 
cggcctggca 
cccccaggcg 
gcagggtccg 
gtggagagag 
gtagtggccc 
tgtccccggg 
cagaggctgc 
tcctacccgc 
actcggttca 
cacacttatg 



gtggttttat 
ccgcagtctg 
ttgactagag 
ctacattttc 
gcctgggcca 
gtgctttgga 
atgcctaaag 
tctgagatga 
gaaggtccag 
cacgtagccc 
tgagccttgg 
ctttagagac 
cccgtcacgg 
cgcaggaccc 
gcgcagtcac 
ccccggaagc 
gccaggacca 
agggctggtc 
atctgaacat 
tagtgtccac 
ctgagggcct 
ccccaggctg 
tgggtggagg 
tgcaggaggt 
gacagcctta 



198180 
198240 
198300 
198360 
198420 
198480 
198540 
198600 
198660 
198720 
198780 
198840 
198900 
198960 
199020 
199080 
199140 
199200 
199260 
199320 
199380 
199440 
199500 
199560 
199620 
199680 
199740 
199800 
199860 
199920 
199980 
200040 
200100 
200160 
200220 
200280 
200340 
200400 
200460 
200520 
200580 
200640 
200700 
200760 
200820 
200880 
200940 
201000 
201060 
201120 
201180 
201240 
201300 
201360 
201420 
201480 
201540 
201600 
201660 
201720 
201780 
201840 
201900 
201960 
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cagccacagc 

gagggtgtat 

cagggatggg 

ggctttgctg 

gcatgatgta 

tcttctcttc 

ggctctgtct 

cctgagcgct 

tgggacaggc 

gaaagaactt 

cctcatgcag 

gctgccctgg 

caagttccga 

ggcatttgtt 

tgacgtgctc 

gctctgtgga 

gatgcagacc 

ttgctctatc 

ggaatttgtg 

tcactggatt 

gaatgtgtgt 

gctatttgat 

tccactcatc 

ttcaggacaa 

tggctggact 

tctggattcc 

agcatcacct 

caaagctgcc 

cttcagtccc 

cctagggagc 

gagcctcttc 

caccagggtc 

gggtctctgc 

actcctacag 

actgtttgag 

cagccccttc 

tcttttatat 

tctgtggcag 

cctgcctttc 

cctatttgaa 

gcctctcatc 

cgactgtttt 

gctgccccat 

cctggtcggg 

tgcctttggg 

cccagactac 

cttccaagct 

cctgggccag 

tggggatgat 

ggacttgaag 

ggaggaggac 

gctgtccatg 

gccgctgaca 

ctgagggggc 

accagatcca 

ctgtgtgtga 

aaagccatac 

tactcgaggg 

aagtggtggt 

aagagtgttc 

gagaggcagg 

ggtgcaggcc 

gccagcctgg 

tcgaaaaaaa 



actgccccct 

ggatgcacct 

ccctgcccct 

gagtcgcccg 

gttaccttca 

cgtgttctga 

ttgcaccaca 

tacgagatgc 

attaagtctg 

cggggccttg 

ctgaatcggt 

gtggtggact 

ctcaacaagg 

gcaggtggtg 

tctgacatca 

catgtccgag 

tggacaccgg 

caccctgaca 

gctgcccatc 

gatcttacct 

ctgcacctgg 

cagccacacc 

ccccggctgt 

cttataaatg 

agagataggc 

atctccctcc 

ggcctgttgt 

gggttggacc 

atacaggcct 

cagttggagg 

catcgagaca 

cggatactgc 

atacgccact 

ctgagcggac 

tataggccgg 

agctccgtgg 

caggcccggc 

cagctgggtg 

gtgctgtcgc 

cccgttgcca 

ggtgcctatg 

gtggcccagt 

gtcctccagg 

accactgagg 

gaagagattc 

aggtcgggcg 

ggactctacg 

ctgagtgata 

ggcggtgccc 

cagagcgaag 

caggaggatg 

gagacggtgg 

gagcagacag 

atgggtcagg 

ctgtagcacg 

gtggaaagtg 

cttgggctga 

ccctgggtgt 

tgagatgtga 

cagacaggga 

gaaaggctaa 

tttaatccca 

tctacaaagt 

aaaaaaaaaa 



cagctctacc 

tcttgccagt 

ctcggggcag 

agatgctcta 

gccctgccaa 

gggccatgga 

ttgctgtaga 

cttccgagga 

aaaaagaggg 

tgctagactg 

tggcaggtcg 

ttaccacacc 

gagataagca 

caggaagtgg 

cgtactatgt 

cgcagtggga 

atgagtgcat 

tgcccgacct 

gagccctcct 

ttggctacaa 

tggacgctca 

cccaacgcct 

tggtccagcc 

gtgcgggcag 

ctgggacagg 

ccgggaaagc 

ctttttctgc 

ctggggaggg 

tggaagagct 

agcctgaaaa 

tgcaggtcct 

agcctgatgc 

ccaaggacat 

ccaaaagtcc 

tttcccaggg 

tccccttccc 

gtgtggagga 

cggtgttaaa 

tcatgtctga 

aggccctggg 

agagcccctg 

tggtggtgcg 

tactggctgg 

atgaggaaag 

agatggatgg 

tcagcttcca 

tggctgaatc 

agagcagtac 

ctgcggacaa 

gctctgagga 

aagtcacggg 

tggctcctgg 

aaggcaaaga 

tgcttttcct 

cacagccaca 

gggcttagat 

ggctgggagt 

cttatctcta 

atggagccag 

acagtgtgtg 

aatggttttc 

gcactcagga 

gagttccagg 

aaagagtgta 



tgccctagac 

gggtgaatcc 

ccctgcctgc 

tgtggtacac 

gctgaccaac 

tgcctgtcac 

cgagaagcta 

tgaaaaccag 

ggaagggaga 

ggtccatggc 

acggcagggg 

ttatgggcgc 

attggacttc 

ggagccaccc 

atacaaggcc 

accccacgag 

acccgagttc 

ggatgtgccg 

ggagagctgg 

actccagggc 

cacccatctg 

ggctggatct 

tattcgggag 

gcttgtcgta 

agaagatgat 

aggtgaccag 

accctcgggg 

tgaagagggc 

ggagaaagtg 

gcctcacgcc 

gggtgtcctg 

acctttgtgg 

ccccgtgtct 

catggtgtcg 

attaccccca 

tccatacttc 

tgaggtccag 

tgacatcact 

ggagcacacg 

ccccaaaaat 

ccgcctgcat 

gctgggcttg 

ggtggaggct 

tgagctcccg 

gcagccggct 

tgaccaggcc 

tccacagccc 

cagcgaagcc 

gaacagcgtc 

agaggaggag 

aacatccgag 

tgatgggaga 

acaaaagatc 

tcaggctctc 

gtcagacaca 

ttagctttca 

ggagttgagt 

gccccagaag 

aactggccgg 

tgtacctctg 

ttaagagagt 

ggcagaggca 

acagccaggg 

gaagggtgga 



tctatacgac 
atcccatgtc 
cccagccttt 
ccttatgtgc 
agccaagcca 
cgccaggggc 
tgcagtgagc 
gagggctctg 
actgagtgtc 
cgaatc.agca 
gatcccaact 
ttccgagacc 
acctatgaga 
catgttcctc 
cgtcgcacac 
tatcctgcca 
tacacggacc 
gcctggtgca 
gaggtgtccc 
aaagaagctg 
accagctatg 
cctgccctgg 
gccacaggcc 
gaggccactc 
ttagaacagg 
ccaggctctt 
• tctcgaccag 
aagattgtcc 
ggtaacttcc 
cagccacctg 
ttggctgaga 
gtacgctttg 
ctgcagcctg 
aagaagggca 
cccagcccag 
ccagcactgc 
ggtcgggagc 
cccgagggct 
gctgtgtaca 
gccaacaagt 
ggccgcttct 
caggccttcc 
tcccaggagg 
gtgtccgggc 
gcttcctcag 
gacctgccgg 
caggaggctg 
tcccagggcg 
aagtcagggg 
gaggaaggct 
ctcactctgt 
gacagagaag 
ctccttggtg 
atatgctggg 
gtgcatggaa 
ggagacagaa 
ggtagagcac 
aagtattaag 
aacagtcggg 
aggctctcat 
ccagaagggc 
ggcggatttt 
ctatacagag 
agccagggac 



aagctctcca 

tatcaaatgt 

tgcgagctga 

aattctccct 

aggtgctctt 

tggcctgtgg 

tccggctgga 

aagagaaaaa 

ccacctgcca 

acttccacta 

atcacccagt 

ttcgtaaatc 

tgacccggca 

accacatctc 

cgcgctcggt 

ccatggagcg 

cctctatctt 

gttctaacca 

aagacctgca 

tgaaggagaa 

gcgtggtaca 

cccctgaacc 

aggaggacat 

catgtgagac 

ctacagaagc 

cctccagtca 

gccgtaggag 

ttccagaggg 

tggccaaagg 

tgcacctgca 

tggtgtttgc 

aggctgttcg 

tgctagacac 

agctagaccc 

cccagctcct 

acaagttcat 

tggcgtttgc 

tagagatcct 

cagcctggta 

acctcctgaa 

acctgtacac 

tcacccacct 

agggcaaagg 

ctggctcctg 

gactggggct 

acacggagga 

aggccgtgag 

aggagagggg 

acagcagcca 

gtgtggtgtt 

ctgacacgat 

aggaagagga 

agcccgtggg 

tgtgggtcca 

tgtggaagtg 

agctccttta 

ttgtctggta 

aaataaaagc 

tggaagtggg 

ggttccatca 

tgggcttggt 

tgagttcgag 

aaaccctgtc 

aagtctgtac 



202020 

202080 

202140 

202200 

202260 

202320 

202380 

202440 

202500 

202560 

202620 

202680 

202740 

202800 

202860 

202920 

202980 

203040 

203100 

203160 

203220 

203280 

203340 

203400 

203460 

203520 

203580 

203640 

203700 

203760 

203820 

203880 

203940 

204000 

204060 

204120 

204180 

204240 

204300 

204360 

204420 

204480 

204540 

204600 

204660 

204720 

204780 

204840 

204900 

204960 

205020 

205080 

205140 

205200 

205260 

205320 

205380 

205440 

205500 

205560 

205620 

205680 

205740 

205800 
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aagaaggaac 

ccagtttctg 

ttgctagcag 

ttatctgatg 

tggctgtctg 

cgcctgctga 

ccagccaggc 

cactccctga 

aagagcccag 

ctgccctcca 

gttcaccgtc 

gccagtccta 

cctgtatgga 

tccctggttc 

gtcttggggg 

tctaggtagc 

ggctgctggc 

tcatggacat 

cctccttcgt 

ccaggaaggg 

ttgggggtag 

tctgagacta 

tctgacttaa 

cactttccca 

tgggggctgg 

ggttagattc 

ccaccctcct 

aaaaaacacc 

ctctgactaa 

agccttacat 

aagggctggg 

gtcctatgcg 

atggtccagc 

catgagcttc 

ggccaggcca 

ctggatccta 

cgacgaccag 

gcgtacacaa 

cacagagttg 

tcagcaatgg 

ctttaactcc 

ttgctcaggc 

tgtgagtgtg 

catctctcta 

ttttgtagac 

gctggccttg 

gtgtaccact 

cctcccctgt 

cgtaggaatt 

ttatctatct 

acttggggag 

gcaggaactc 

tggcttgctc 

tatgcctggc 

cccctgcctc 

gggtggcact 

cctccatagg 

ctgtagctga 

accaaacaaa 

cagcaggaaa 

gatggtctcc 

ttatgggatg 

gaatattttg 

tcattgtgtg 



ttgggagcat 

gggaccttga 

agccacagcc 

agctcaggac 

ccaagcttgg 

catcttgtta 

ttctgcccat 

gcagggcctg 

tattgaatga 

ctggcttctc 

agcagtgatg 

ggtgacatcg 

gaacccgttc 

gtcaaacccc 

ttccaggagg 

cccagggagc 

agcggtgaca 

tctgccccgc 

cacagggtag 

ctaggaagct 

tggcacaggt 

gcctggtcta 

gaaaagagtc 

gcatgcagga 

aaagaatagc 

ccagctgcca 

tcggtctctg 

catacacata 

gctacactgc 

gcagctcctg 

aactcttgca 

tgaaaaccat 

agcacctgag 

ggcagcaggt 

gggcagtgga 

agggctgtac 

tggaccccac 

tctacgtacc 

gtgactacat 

aatcttttta 

agctccttcc 

ttcttncatg 

gaggtgacag 

gatccagttt 

agggtctctc 

agctcacaga 

cctgcctggc 

gctgggatta 

ttaggatcag 

tagttaggat 

gaaaaggttt 

gaggcagaag 

agtctggtgt 

tgtcctagaa 

tgcatctcaa 

acccactgta 

ccaatctggt 

tgccaaattg 

ccaaacaact 

ggctcaggag 

agaagctacc 

ggcgctgtac 

ctcctacaga 

tctgatgcta 



tgccgaaagg 
ccatggctag 
gagttggctg 
cttttcctgc 
ccccacagta 
tgttggtaag 
ccttagccct 
ggtctcaccc 
atagaagcca 
aacgctgctg 
acacccctcc 
tgtcggggcc 
tcacctacca 
ggcttggggg 
tctgtggggt 
aactcaaacc 
ctgacgcaga 
attagccacg 
gcccctgctg 
cagggagaag 
ctttcattcc 
cagagagagc 
gtggttcatt 
ggagctatgc 
tcagggttta 
catggtagct 
tgggcactgc 
aaattagacc 
ttccctgtgc 
ttatttgaag 
ctttgattct 
cagtctcatc 
tgagccagtg 
aggcaggcag 
cccactgaat 
tgagggccag 
cctgctggaa 
tttctcctgc 
ctcttccctg 
tttttatttt 
agcttccacc 
ttcttatttt 
aataacagtt 
ttggttttgt 
tgtgtagcct 
gattcacctg 
tttgtttttg 
caggtgtgtg 
agtgaccaga 
ctctgttgaa 
attttgtctg 
ctgaagcaaa 
accccctccc 
ctcactctgt 
gtgctaagat 
aattggtccc 
gggggaattt 
ataaaacaaa 
aaccaagaca 
ctggccattc 
tgcaaaggaa 
tgggcttctc 
attgtaagtt 
gaatggtgcc 



atgacctctc 
gtgaatggac 
ggtggggtgg 
cctgcagata 
gcctctcgcc 
gtctgtggtt 
ctctaggcga 
attaagctgg 
ccccacagtc 
cccttccttg 
actgaatgcc 
tgtgctcagc 
gtacctgccc 
tgggggcaag 
gacctgtccc 
ccagccgact 
aaatcatcgt 
aggtcttgct 
cttgggagag 
cagataccgg 
agcacccaga 
tccaggctat 
ctgggttgtg 
ttgagttcca 
agagcactgg 
cataaccatc 
aaacatgtgc 
aaaaaagttc 
ctcagtttcc 
ttcctggtaa 
aggttcccca 
gccctcatct 
gccaccttct 
cttctgggct 
ctgtggtctt 
ctgccagagg 
gagctgcaga 
ctgttgggta 
gggtgggccc 
gacatggggt 
gtctcctgtt 
taaatgacct 
ggggggtcag 
gggcttttat 
ggccatcctg 
cctctgcctc 
ttaaccacca 
ctatcacacc 
tttggtccta 
aatcatggtg 
acaactctca 
agccatggaa 
cacccacctc 
agactaggct 
taaaggcggg 
ttcccatatc 
tctcagttga 
tctcaaacca 
gtgacttata 
aagtctgggg 
tgaacagctt 
tctgagtagg 
cccagaggca 
tggcatacac 



tgcaggtcct 

ccaggatggg 

ggtggggtgg 

cagcctgcaa 

atgtggcccg 

agtgctggag 

ctccttccct 

gttttcttgg 

tcagaaggqc 

gtagggccca 

ggcaacatct 

tgcctcctcc 

tacatcagct 

gatccaagga 

tccctcatct 

gaacagccgc 

atacctctct 

gcctgtgctt 

ccacctggct 

cctgagtcat 

ggagggcaag 

ctaaggctcc 

ggtgtggctt 

gcccttcaga 

ttgctcttcc 

cggcagttct 

acagacatac 

atgttctctc 

tcccctggtc 

attggtcaag 

gtggggccca 

gcttgcgcat 

tccaagtctt 

gggtgggcca 

cctacccgca 

cgaccttctc 

aggtgttcac 

ttgcccatca 

cgatgctttc 

ctcatttagc 

ggcattgtag 

gtgtgtgtgt 

cagatgcctt 

gtgtgtgttt 

gaactcattc 

ccagtgctgg 

tcctcctgcc 

cagctaacag 

gggcccaatt 

gaacatcatt 

ggtcaccaag 

gaactctggc 

cccacaatgg 

ggcctcaaac 

tgccatcacc 

agttgttaaa 

gggtttctct 

ccaccaccaa 

aagagaatct 

aacagaatgt 

gctgggtttt 

atgggccacc 

ggacacatct 

gtgtgtgtct 



gcccgaggag 

atggtcaggc 

gaagggtgag 

gatggtccgc 

gaacctgctg 

accaggttcc 

aacttcccag 

gtaagtgggg 

ggcttccctc 

ctcgacagca 

accagaagag 

acattgccta 

acctggtcag 

ccagccccag 

attctgtggt 

aaggaggccg 

gacacgaccc 

ggcttcctca 

gagggggccc 

ggttctgatg 

tttctgtgag 

atagtaagac 

ggtgatggga 

aaaacaaaaa 

agaggatcca 

atggaacctg 

atgcaggcag 

ctacctgtag 

tggactgatc 

tccttcaggg 

ggcccggact 

cgggcaggag 

ctcfccatctg 

ggccaggcca 

ggatctgcca 

tgatgggcag 

cctggaaatg 

cgttcctttg 

acctccagag 

ccaggctgac 

tcatgtggca 

gtgatatctg 

gcctgatgag 

gtttgtctgt 

agtagaccaa 

gattaaaggc 

tcagcatctg 

tggatttaaa 

tccacagtga 

accaaatgca 

ggaagtcagg 

ttgttcctca 

tttctctgct 

tcaagagatc 

cctgccccag 

taagaaaact 

tctcaaatga 

caacaataaa 

gaacattttc 

aggggaatat 

gtggcttccc 

ctgtagttgg 

gtcttattct 

ctatagagac 



205860 

205920 

205980 

206040 

206100 

206160 

206220 

206280 

206340 

206400 

206460 

206520 

206580 

206640 

206700 

206760 

206820 

206880 

206940 

207000 

207060 

207120 

207180 

207240 

207300 

207360 

207420 

207480 

207540 

207600 

207660 

207720 

207780 

207840 

207900 

207960 

208020 

208080 

208140 

208200 

208260 

208320 

208380 

208440 

208500 

208560 

208620 

208680 

208740 

208800 

208860 

208920 

208980 

209040 

209100 

209160 

209220 

209280 

209340 

209400 

209460 

209520 

209580 

209640 
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agcactcatg 
gacgagaact 
ctaagctctg 
ccctgtaggt 
agggctctat 
catggctagt 
ccactcaggg 
gccccagagt 
caacaggaat 
cgggaactgg 
ccacttccac 
ggccgccctg 
ctggccgctg 
ccagcaccgc 
ctgtgatggg 
tgttcgacgg 
acgtacaatg 
ggacttagaa 
tcagttgctg 
agtatggctt 
tggaggaggg 
cagactgttc 
tgagattgtg 
tttgaagtag 
aagacccttc 
atgcctgccc 
gactgcagga 
tctaggtacc 
cagccctgtg 
ggagggtcag 
gcagtgatgg 
tgggtggagg 
gtgttgtggc 
ttctacgagg 
tgaggtccta 
ggtgtccctt 
agctcaggaa 
cctgatgctc 
ggatcagggg 
ttactgcctg 
gcaggaacag 
cagagctcag 
gcagtgggaa 
acagggaagg 
gtctctaaat 
tagtcccagc 
atgccaggtg 
cagggggttc 
acacacacga 
catgttagta 
tttatttttt 
atcgcaacac 
acagagcaag 
acaaaatatt 
ggctgtcctg 
tgcttctgcc 
ttatagattt 
aggggtcaga 
ccatgctggg 
ctagagtttt 
tttcgagaca 
gctggcctcg 
gtgcgccacc 
ttgttttgtt 



tctacgtatc 
taataaccca 
ggtagatagc 
gacatcatcc 
ctggaaagca 
gccggccctg 
acctttggga 
cctgggccac 
gaagacaacg 
ctggcgtact 
cagatccgcc 
agcagtgaag 
tacaactatg 
aaaagcgtct 
gcagtgcacg 
ctgctttact 
gcgtattttg 
tctgtgggac 
caggcgtctg 
tggcttcttg 
tcagggcaga 
tggggctgga 
ttttctgagg 
ccagggctct 
gcacagtgga 
cacacaccag 
agccaggctt 
agtcaggaaa 
tgggtggggc 
gcctactggt 
atggcgacgg 
gctgaaccct 
tggcttctcc 
ctggccagcc 
tcctttcatt 
ataactactc 
ccagctgcac 
cacagcgatg 
tcacgccctg 
gagaccactg 
gcgcaggcct 
caggttctgg 
ctctcctccc 
caggctgccc 
cccaacattc 
tctcggagtc 
ttctttgaga 
aacacccttt 
acacaaataa 
cgattggatg 
ggtcttaaaa 
tcaggagcag 
tttcagggca 
taaaagattt 
gaactcactg 
tcctaagtgc 
ggtgttttgc 
ggcgagtatc 
actgggtcat 
tttcccccct 
gggtttctct 
aactcagaaa 
acgcctggct 
tttttctttt 



gataaaggaa 
gagtagccca 
taccttgcca 
ggaaaatcat 
tgagcccgag 
aatgggaccc 
gtgtcctggt 
tgggctccct 
ccctgaagcg 
ggcagtacga 
tgcagagctt 
acttctttct 
gggacgggac 
tctacgtggg 
tctgggaccc 
gtgccttagc 
acggggaaga 
ccaagtttga 
ccttctacct 
tacgacgtca 
cggaggtcgc 
cccttgactt 
gtcactgggg 
ttgatagcca 
tccttcagac 
catcaccatg 
gcaggtcagg 
gacactcagg 
gctggcacgg 
ttcgtgggcc 
gcacttgaac 
gggcttgttc 
tcgggcttca 
catgaagggg 
tctacttagg 
ggggacattc 
aggaggcagg 
ttctaatgag 
tccgtgaccc 
agcagagtcc 
gcgcatcgtg 
gaggggagaa 
ccgccgtcca 
ccagctctat 
taagaaagtg 
ttctctgagc 
ggacctgagt 
atggcttctg 
aaataaatgt 
tggcagtgcc 
ttttaaaaga 
aggcaggtgg 
gccaaggtta 
acttacttct 
tatagaccag 
tgagattaaa 
ctgcttgtgt 
agctcttcct 
ttgcaagagt 
ggctgtcctg 
acataggcct 
tccgcctgcc 
cagaatctgt 
ctttttttgt 



gctgttttgg 
gtcagtacct 
tcttccctga 
ccccaaccat 
ctctcgaaac 
tcagagtggg 
tggaaatcgc 
ctctggagtg 
ggagctgcct 
gatcggtgtg 
cccagggcac 
gagtggcagc 
caatgagacg 
ccagcttgag 
cttcacaggt 
caggcctctg 
ttcagtgagg 
atccactccc 
gtaagaacca 
cctgtcgtcg 
agctagttag 
cactgtggaa 
aatgggatgc 
ctaagtcccc 
agccgggtgc 
gccagctccg 
aggggtgcag 
ggactccacc 
atggggcttt 
tgaatggggt 
acaatctcct 
gctcgttggc 
tggtgctcct 
acattctaca 
gcctggtctg 
agtggggtgg 
ctaggggcag 
taacccttgt 
agttcaggtt 
atgccccctg 
ttcctgcctc 
atagggctgc 
gatggaggtg 
ccaggacccc 
tcaggatggc 
accagttctt 
ttggttctca 
tggacacata 
taaaagaaga 
tcacagaatc 
ttggtttagg 
atctctatgg 
cacagagaaa 
tatttgagct 
gctggcacct 
gtagtgttat 
atatatgcac 
ggaactagag 
tacaagtact 
aagtagaatc 
ggctgtcctg 
tctggctctc 
ttttaaatga 
ttttgttttt 



ggggaggaaa 
tgccttggct 
tcttagaact 
gagttggtcg 
ccagccagca 
agctgtctcc 
atccagatcc 
ggtagtagcg 
cggagtgcqc 
agccagcagg 
acgggggccg 
aaggaccgga 
gcttcccgcc 
gccccgcagt 
gagcgggccc 
ggaacgggac 
caggaagaga 
ccaacttacc 
aaaatttaga 
ttgtaaagag 
agcatgctat 
gcagcaagat 
aggtgtgggg 
agatgtgtcc 
ccctgacggc 
actccactct 
ttcctgggct 
aggaacgctg 
tctcttccgg 
gagctgcagt 
cctatagcat 
cgtcagcccc 
agatacccgc 
gatcaaggtg 
ggagaggaca 
gaaaatggcc 
gaatcagggc 
ccatatttgt 
aaataaagcc 
ctgggctgtc 
ctatattcaa 
ttgtgggagg 
aaagacaggc 
aggggaccct 
tctggggtca 
ttcctatggg 
gcactgtcta 
ttcttatgtg 
cagcggcacc 
tgtgggactt 
agtggtggta 
gtttgaggcc 
ctctttctca 
aggatctccc 
taaactcacg 
accatgcccc 
taccttcatg 
ttatggaagg 
tctgagccat 
tgttcttgtt 
gaactcactc 
agaatactgg 
gagtaatagt 
tggctcattt 



caggcttaca 
tctgttgttt 
ttccccactc 
gggagctggc 
tggaacccac 
aggacgatgg 
ctgactctca 
gaggcctcag 
atgggctgag 
atgcccactt 
tcaaatgcgt 
ctgtgcgcct 
tcatctatgc 
atgtggtgag 
aggtgaggcc 
ctagtgcgaa 
agaagagtca 
agcaatcggc 
agattccacg 
aagtatcgag 
gtgaagagag 
gagaaagccc 
tgagttggag 
tttttcagga 
tgtggctgtc 
gcgctttgtg 
actgggggtc 
cagtgacagg 
ggatggagtg 
agggtgggtg 
gagttccgac 
agtggccgga 
acgggcctgg 
actgactgcc 
ggtttatgct 
ctcgtaggcc 
tagaactgac 
cttgcttgga 
aggaggctgt 
ctgatggggg 
tcatagacct 
atttctccct 
actgttgctt 
gggtctcagt 
tcctgggtgt 
aagtaaggac 
gctctggctc 
gcctgcacgc 
ttgtacctca 
tatttattta 
cacaccatta 
agcctggtct 
aaaaataaaa 
tattagccct 
aagatcctcc 
actattttct 
cagtgctaat 
ttgggagtca 
ctccagcccc 
ttgttttgtt 
tgtagaccag 
aattaaaggt 
tacaggtttt 
gttttatttg 



209700 
209760 
209820 
209880 
209940 
210000 
210060 
210120 
210180 
210240 
210300 
210360 
210420 
210480 
210540 
210600 
210660 
210720 
210780 
210840 
210900 
210960 
211020 
211080 
211140 
211200 
211260 
211320 
211380 
211440 
211500 
211560 
211620 
211680 
211740 
211800 
211860 
211920 
211980 
212040 
212100 
212160 
212220 
212280 
212340 
212400 
212460 
212520 
212580 
212640 
212700 
212760 
212820 
212880 
212940 
213000 
213060 
213120 
213180 
213240 
213300 
213360 
213420 
213480 
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ttttgagaca ggtctcactc tgaaccccta ggtggcctgg agcttgctat gtagaacaca 213540 

ctgactttaa acttgttttc tgagtgctgg atttatgggc ttgtgctatt ttgcccagcc 213600 

tctgatggtt gttaataaca atattattta gcttttcttt tggagatagg ctctcactgt 213660 

atatcaccca gacagtggct ggtctggaaa tcactgtgta ggccaggctg accttgaatt 213720 

cacagagatc tgcctcccga gttcagaaat taaaagcact ctgggatggt tttggagttt 213780 

ggtgagtacc caagcctcca ttgatgctat ctgtccctcc cgctctctgc aggctgtaga 213840 

gggcagcgtg ctcatcagct cctcttccga ccattccttg actgtttgga aggagctgga 213900 

acagaagccc acgcaccact acaagtcagc gtccgaccca atccacacct ttgacctgta 213960 

cggcagcgag gtggtcaccg gcactgtagc caacaagatt ggtgtctgtt ccctgcttga 214020 

gccaccctct caggccacca caaagctcag ttccgagaac ttccgtggca cgctcactag 214080 

tctggctttg ctgcccacga aacgccacct cctgctgggc tcggacaatg gcatcatccg 214140 

cctcctggca tagggccagc caggagttgg ctgagggcag ggcgagatga catctctcag 214200 

ggcccgctcc tcattcttga tctcgaagcc gattcttcta ggcaagcccc aggctctggc 214260 

tacccacatg gcctgctgtc tgggattgca cagctcctga atctccaaag ccttgaagtg 214320 

gcttcatgaa actcgggaga tactgttcct aaccagcaag aattggggca aggaaagcac 214380 

tgtgatcccc attgctcccc agttctgcct tctggattca catggggaca gggcagctcc 214440 

aggaaatgaa aggagttggg cctttgctca gccagcttcc tctagccacg ctctccttag 214500 

ctctgtttct cccttgggta ggaaactgct cctgtctagg gttctgatgg tactgggact 214560 

ccaggctcag gagggctggc caggacctac gactttcagg gcttggtctg gggttttagc 214620 

attcattcag ccaggtcttc agtatgggac cagaaaaaag gggatgtgag aacagggcta 214680 

gggaaggggt tatatgggcc cagctggtcc aggaatgaat ccatgccttg ccttggtacc 214740 

cctaaccaca gcgtttgtgc cttcagccgg ggaggcagcc cttgggacca gcatccctag 214800 

ggacaggagg cagcgggaat catctctgta tctcgggttc tgcccagggg atgggcagac 214860 

tctgccatct cttgagtgtt cgtttggaga agcctgagat gtggcccctg ctgccttctc 214920 

actagttgca gtctatgtaa ataaggtcaa taaattcttt ggaagagcca cggagctgag 214980 

tgaggctgtg ttgtgttttg ctttgcctag gctgggctca ggcagctctg cctcagcctc 215040 

ccaaggagct ggggaactgg tatatgtcac tgtatatgtc * actgtgcctg gcttatggct 215100 

tggcttggct ttttttcaga tggtctcaag tgcctcaggt tggccttgat cttgggatga 215160 

ccttcctgct tgaaacagag tagtgggctt ataggcatga cccaccaggt ccaattttta 215220 

ttttttaaag gcattgattt ttatacgtgt atggttgttt tgcccacttg tacatatgca 215280 

caccatactt gtgtctggtc cctgcggagg tcagaagagg gcatcgggat cacctggaac 215340 

cgaagttaat gaatggttat gagccacatc tcgatgctga agattgaacc tggatccttt 215400 

gcaagagcag ccagtgttct tacccactga gccatctcta agccccacac ccagcttctt 215460 

ttgatacaag gtctggtagc tcaaacttga tatgcagccg aggaggttga cctggtattc 215520 

cctacctacc ctcttctctc taccttccaa gtgctgatat tatacatagg catggatagt 215580 

catgcccacc agtttgcctt gatggcacca gagtcaggaa agtccaaacc tggtagttgc 215640 

aaacacagca agagggtaga ggcagccatt gtcctctggc tgccttggat acagagcttc 215700 

tgggttgggt ggccttgggt cagttttccg aatggttcac ccttggggaa agggaacact 215760 

gctgaagagg tgggaccctg ggagggccgg cctccagctg ggtctctcca gccctcgcct 215820 

tggaacctag gctggaggga gccaaccagg atcctggact tgctacagtt aggtgaacag 215880 

gctcctgcag cctccccttc ccttgggtag ctgtggtggt ggtggtggtg gtggtggtgg 215940 

tggtggtggt ggtggtggtg gtgggggggg gggngnngnt 215980 



<210> 17 
<211> 473 
<212> PRT 
<213> Mus sp- 

<400> 17 

Met Lys Arg Ala Ser Ser Gly Gly Ser Arg Leu Leu 7U.a Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Arg Val Ala Thr Pro Cys Pro Gly Ala Cys Val 

20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Thr Gly lie Pro Ala Ser Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg He Ser His Val Pro Ala Ala Ser Phe Gin Ser Cys 
65 70 75 80 
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Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Ala Leu Ala Arg lie 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Thr Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu His Val Val Asp Pro Thr Thr Phe His Gly 
115 120 125 

Leu Gly His Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Arg Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 155 160 

Leu Gin Asp Asn Asn Leu Gin Ala Leu Pro Asp Asn Thr Phe Arg Pisp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg lie Pro Ser 
180 185 190 

Val Pro Glu His Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn His Val Ala Arg Val His Pro His Ala Phe Arg Asp 

210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Met 
225 230 235* 240 

Leu Pro Ala Glu Val Leu Met Pro Leu Arg Ser Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 

260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Asn 
275 280 285 

Leu Pro Gin Arg Leu Ala Asp Arg Asp Leu Lys Arg Leu Ala Ala Ser 
290 295 300 

Asp Leu Glu Gly Cys Ala Val Ma Ser Gly Pro Phe Arg Pro He Gin 
305 310 315 " 320 

Thr Ser Gin Leu Thr Asp Glu Glu Leu Leu Ser Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala T^p Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Thr 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His lie Asn Asp Ser Pro Phe 
370 375 380 

Gly Thr Leu Pro Ser Ser Ala Glu Pro Pro Leu Thr Ala Leu Arg Pro 
385 390 395 400 

Gly Gly Ser Glu Pro Pro Gly Leu Pro Thr Thr Gly Pro Arg Arg Arg 
405 410 415 



Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
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420 425 430 

Gin Ala Gly Ser Gly Ala Ser Gly Thr Gly Asp Ala Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ala Leu Ala Cys Ser Leu Ala Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 
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